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REQUEST 



The undersigned requests that the present 

international application be processed 
according to the Patent Cooperation Treaty, 



For receiving Office use only 



International XpplD^^ No. ^ 1/^-^287 



U FEB 2001 

International Filing Date L 



PCT INTERNATIONAL APPUCATION ROAJS 

Name of receiving Office and "PCT International Application" 





Applicant's or agent's file reference 1452 PCT-00 
(if desired) (12 characters maximum) 


Box No. I TITLE OF INVENTION 

Expression of an Antimicrobial Peptide Via the Plastid Genome to Control Phytopathogenic Bacteria 


Box No. II APPLICANT 


Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below,) 

Auburn Uniiversity 
309 Samford Hall, 
Auburn University, AL 36849, US 


1 1 This person is also inventor. 


Telephone No. 
334-844-4977 


Facsimile No. 
334-844-5963 


Teleprinter No. 


State (that is, country) of nationality; 
US 


State (that is, country) of residence: 
US 


This person is applicant all designated K/| all designated States except 1 1 the United States 1 1 the States indicated in 
for the purposes of: L — 1 States the United States of America 1 1 of America only 1 1 the Supplemental Box 


Box No. Ill FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 


Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is. country) of residence if no State of residence is indicated below.) 

University of Central Florida 
4000 Central Florida Blvd.. 
Orlando. FL 32816. US 


This person is: 

[X| applicant only 

1 1 applicant and inventor 

1 1 inventor only (If this check-box 
is marked, do not fdl in below.) 


State (that is, country) of nationality: 
US 


State (that is. country) of residence: 
US 


This person is applicant f 1 all designated PyH all desi^ated States except 1 1 the United States 1 1 the States indicated in 
for the purposes of: b — F States ^1 the United States of America 1 I of America only 1 1 the Supplemental Box 


1 1 Further applicants and/or (further) inventors are indicated on a continuation sheet. 


Box No. IV AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 


The person identified below is herebv/has been appointed to act on behalf ItT! 

of the applicant(s) before the competent International Authorities as: 1 1 ^^ent |2$J common representative 


Name and address; (Family name followed by given name: for a legal entity, full official 
designation. The address must include postal code and name of country.) 

WEISER, Gerard J. 

1600 Market Street, Suite 3600, 

Philadelphia. PA 19103-7286, US 


Telephone No. 
215-751-2427 


Facsimile No. 
215-568-6946 


Teleprinter No. 


1 1 Address for correspondence: Mark this check-box where no agent or common representative is/has been appointed and the 
1 1 space above is used instead to indicate a special address to which correspondence should be sent. 
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Continuation of Box No. Ill FURTHER APPLICANTS AND/OR (FURTHER) INVENTOR(S) 


If none of the following sub-boxes is used, this sheet is not to be included in the request. 


Najnc &nu flodrCSSI (PqihUv name followed hv divert namp' for n Ip^nl pntitv full nfflrinl H^^iayintinfj 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below,) 

DANIELL, Henry 

1255 Marina Point. #315 

Casselbeny, FL 32707, US 


This person is: 

I 1 applicant only 

1 1 applicant and inventor 

IXl inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 
US 


State (that is, country) of residence: 
US 


This person is appHcant all designated 1 1 all designated States except the United States | 1 the States indicated in 
for the purposes of; I — 1 States 1 1 the UniTed States of America of America only 1 1 the Supplemental Box 


Name and address: (T^am/Vy name followed by given name; for a legal entity, full ojficial designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below.) 


TTiis person is: 

I 1 applicant only 

1 1 applicant and inventor 

1 1 inventor onlv (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 


State (that is, country) of residence; 


This person is applicant 1 I all designated 1 1 all desi^ated States except 1 1 the United States 1 1 the States indicated in 
for the purposes of: 1 — 1 States 1 1 the United States of America 1 1 of America only 1 1 the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is. country) of residence if no State of residence is indicated below.) 


This person is: 

1 1 applicant only 

I 1 applicant and inventor 

1 1 inventor onlv (If this check-box 
is marked, do not fill in below.) 


State (that is. country) of nationality: 


State (that is, country) of residence: 


This person is applicant 1 1 all designated 1 1 all desi^ated States except 1 1 the United States 1 1 the States indicated in 
for the purposes of; 1 — 1 States 1 1 the United States of America 1 1 of America only 1 1 the Supplemental Box 


Name and address; (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant's State (that is, country) of residence if no State of residence is indicated below.) 


This person is: 

1 1 applicant only 

1 I applicant and inventor 

1 1 inventor onlv (If this check-box 
is marked, do not fill in below.) 


State (thai is, country) of nationality: 


State (that is, country) of residence: 


This person is appMcant 1 1 all designated 1 1 all desijgnated States except 1 1 the United States 1 1 the States indicated in 
for the purposes of; 1 1 States 1 1 the United States of America l_J of America only 1 1 the Supplemental Box 


1 1 Further applicants and/or (further) inventors are indicated on another continuation sheet. 
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Box No.V DESIGNATION OF STATES 



The following designations are hereby made under Rule 4.9(a) (mark the applicable check-boxes; at least one must be marked): 
Regional Patent 

Kl AP ARIPO Patent: GH Ghana, GM Gambia. KE Kenya, LS Lesotho, MW Malawi, MZ Mozambique, SD Sudan, SL Sierra 
Leone, SZ Swaziland, TZ United Republic of Tanzania, UG Uganda, ZW Zimbabwe, and any other State which is a 
Contracting State of the Harare Protocol and of the PCT 

Kl EA Eurasian Patent: AM Armenia, AZ Azerbaijan, BY Belarus, KG Kyrgyzstan, KZ Kazakhstan, MD Republic of Moldova, 
RU Russian Federation, TJ Tajikistan, TM Turkmenistan, and any other State which is a Contracting State of the Eurasian 

Kl EP European Patent: AT Austria, BE Belgium, CH and LI Switzerland and Liechtenstein, CY Cyprus, DE Germany, 
DK Denmark. ES Spain, FX Finland, FR France, GB United Kingdom, GR Greece, IE Ireland, IT Italy, LU Luxembourg, 
MC Monaco, NL Netherlands, PT Portugal, SE Sweden, and any other State which is a Contracting State of the European 
Patent Convention and of the PCT 

Kl OA OAPI Patent: BF Burkina Faso, BJ Benin, CF Central African Republic, CG Congo, CI Cote d'lvoire, CM Cameroon, 
GA Gabon, GN Guinea, GW Guinea-Bissau, ML Mali, MR Mauritania, NE Niger, SN Senegal, TD Chad, TG Togo, and any 
other State which is a member State of OAPI and a Contracting State of the PCT (if other kind of protection or treatment 
desired, specify on dotted line) 



National Patent (if other kind of protection or treatment desired, specify on dotted line): 





AE 




AG 




AL 




AM 




AT 




AU 




AZ 




BA 




BE 




BG 




BR 




BY 




BZ 




OA 


IS 


CHj 




CN 




CR 




CU 




CZ 




DE 




DK 




DM 




DZ 




EE 




ES 




FI 




GB 




GD 




GE 




GH 




GM 




HR 




HU 




ID 




IL 




IN 




IS 




JP 




KE 




KG 




KP 




KR 




KZ 



United Arab Emirates Kl 

Antigua and Barbuda Kl 

Albania Kl 

Armenia Kl 

Austria Kl 

Australia Kl 

Azerbaijan Kl 

Bosnia and Herzegovina Kl 

Barbados Kl 

Bulgaria Kl 

Brazil Kl 

Belarus Kl 

Belize Kl 

Canada Kl 

nd LI Switzerland and Liechtenstein 8] 

China Kl 

Costa Rica Kl 

Cuba Kl 

Czech Republic Kl 

Germany Kl 

Denmark Kl 

Dominica Kl 

Algeria Kl 

Estonia Kl 

Spain Kl 

Finland Kl 

United Kingdom Kl 

Grenada Kl 

Georgia Kl 

Ghana Kl 

Gambia ^ 

Croatia Kl 

Hungary Kl 

Indonesia 

Israel ES 

India Kl 

Iceland Kl 

Japan Kl 

Kenya Kl 

Kyrgyzstan Kl 



LC Saint Lucia 
LK Sri Lanka 
LR Liberia 

LS Lesotho 

LT Lithuania 
LU Luxembourg 
LV Latvia 

MA Morocco 

MD Republic of Moldova 

MG Madagascar 

MK The former Yugoslav Republic of Macedonia 
MN Mongolia 

MW Malawi 

MX Mexico 

MZ Mozambique 
NO Norway 

NZ New Zealand 

PL Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SD Sudan 
SE Sweden 
SG Singapore 

SI Slovenia 

SK Slovakia 

SL Sierra Leone 

TJ Tajikistan 

TM Turkmenistan 

TR Turkey 

TT Trinidad and Tobago 

TZ United Republic of Tanzania 

UA Ukraine 

UG Uganda 

US United States of America 

UZ Uzbekistan 

VN Viet Nam 

YU Yugoslavia 

ZA South Africa 

ZW Zimbabwe 



Democratic People's Republic of Korea Check-boxes reserved for designating States which have become 

Republic of Korea party to the PCT after issuance of this sheet; 

Kazakhstan Kl .CP.-. .^](>^.^( ft. 1 

Precautionary Designation Statement: In addition to the designations made above, the applicant also makes under Rule 4.9(b) all 
other designations which would be permitted under the PCT except any designation(s) indicated in the Supplemental Box as being 
excluded from the scope of this statement. The applicant declares that those additional designations are subject to confirmation and 
that any designation which is not confirmed before the expiration of 15 months from the priority date is to be regarded as withdrawn by 
the applicant at the expiration of that time limit. (Confirmation (including fees) must reach the receiving Office within the } 5 -month time limit.) 
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Supplemental Box If the Supplemental Box is not used, this sheet need not be included in the request. 



J. If, in any of the Boxes, the space is insufficient to furnish all the information: in such case, write "Continuation of Box No. ..." 
[indicate the number of the Box] and furnish the information in the same manner as required according to the captions of the Box in 
which the space was insufficient, in particular: 

(i) if more than two persons are involved as applicants and/or inventors and no "continuation sheet" is available: in such case, 
write "Continuation of Box No. Ill" and indicate for each additional person the same type of information as required in Box 
No. III. The country of the address indicated in this Box is the applicant's State (that is, country) of residence if no State of 
residence is indicated below; 

(ii) if, in Box No. II or in any of the sub-boxes of Box No. Ill, the indication "the States indicated in the Supplemental Box" is 
checked: in such case, write "Continuation of Box No. 11" or "Continuation of Box No. Ill" or "Continuation of Boxes No. II 
and No. Ill" (as the case may be), indicate the name of the applicant(s) involved and, next to (each) such name, the State(s) 
(and/or, where applicable, ARIPO, Eurasian, European or OAPI patent) for the purposes of which the named person is 
applicant: 

(Hi) if in Box No. II or in any of the sub-boxes of Box No. Ill, the inventor or the inventor/applicant is not inventor for the 
purposes of all designated States or for the purposes of the United States of America: in such case, write "Continuation of 
Box No, 11" or "Continuation of Box No. Ill" or "Continuation of Boxes No. II and No. Ill" (as the case may be), indicate the 
name of the inventor(s) and, next to (each) such name, the State(s) (and/or, where applicable, ARIPO, Eurasian, European or 
OAPI patent) for the purposes of which the named person is inventor: 

(iv) if, in addition to the agent(s) indicated in Box IV, there are further agents: in such case, write "Continuation of Box No. IV" 
and indicate for each further agent the same type of information as required in Box No. IV; 

(v) if, in Box No. V, the name of any State (or OAPI) is accompanied by the indication "patent of addition," or "certificate of 
addition f" or if, in Box No. V, the name of the United States of America is accompanied by an indication "continuation" or 
"continuation-in-part": in such case, write "Continuation of Box No. V" and the name of each State involved (or OAPI), and 
after the name of each such State (or OAPI) , the number of the parent title or parent application and the date of grant of the 
parent title or filing of the parent application; 

(vi) if in Box No. VI, there are more than three earlier applications whose priority is claimed: in such case, write "Continuation . 
of Box No. VI" and indicate for each additional earlier application the same type of information as required in Box No. VI: j 

if, in Box No. VI, the earlier application is an ARIPO application: in such case, write "Continuation of Box No. VI", specify 

(vii) the number of the item corresponding to that earlier application and indicate at least one country party to the Paris 
Convention for the Protection of Industrial Property or one Member of the World Trade Organization for which that earlier 
application was filed. 



2. If, with regard to the precautionary designation statement contained in Box No. V, the applicant wishes to exclude any State (s) 
from the scope of that statement: in such case, write "Designation(s) excluded from precautionary designation statement" and 
indicate the name or two-letter code of each State so excluded. 

3. If the applicant claims, in respect of any designated Office, the benefits of provisions of the national law concerning 
non-prejudicial disclosures or exceptions to lack of novelty: in such case, write "Statement concerning non-prejudical disclosures 
or exceptions to lack of novelty" and furnish that statement below. 

Continuation of Box IV: 



T. Daniel Christenbury 
Paul A. Taufer 
Robert A. McKinley 



Reg. No. 31,750 
Reg. No. 35,703 
Reg. No. 43,793 
Reg. No. 45.269 
Reg. No. 38,940 
Reg. No. 47,042 
Reg. No. P48,052 



Guy T. Donatiello Reg. No. 33.167 

James A. Drobile Reg, No. 19.690 

Austin R. Miller Reg, No. 16.602 
Stewart M. Wiener Reg, No. 46,20 
Michael A, Patane' Reg. No. 42,982 
Sharon Fenick Reg. No. 45,269 



Sharon Fenick 
Joan T. Kiuger 
Felicity Rowe 



Stephenie Yeung 
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Box No. VI PRIORITY CLAIM □ Further priority claims are indicated in the Supplemental Box. 


Filing date 
of earlier application 

(day/month/year) 


Number 
of earlier application 


Where earlier application is: 


national application: 
country 


regional application:* 
regional Office 


international application: 
receiving Office 


item(l) 

29/02/2000 


60/185,662 


US 






item (2) 










item (3) 










Ixx The receiving Office is requested to prepare and transmit to the International Bureau a certified copy 
l^J of the earlier application(s) (only if the earlier application was filed with the Office which for the 

purposes of the present international application is the receiving Office) identifledabove as item^^s): (1 ) 
* Where the earlier application is an ARIPO application, it is mandatory to indicate in the Supplemental Box at least one country party to the Paris Convention for the 
Protection of Industrial Property for which that earlier application was filed (Rule 4. 1 0(b)(ii)). See Supplemental Box. 


Box No. VII INTERNATIONAL SEARCfflNG AUTHORITY 


Choice of International Searching Authority (ISA) 

(if two or more International Searching Authorities are 
competent to carry out the international search, indicate the 
Authority chosen; the two-letter code may be used): 

ISA/ US 


Request to use results of earlier search; reference to that search (if an earlier 
search has been carried out by or requested from the International Searching Authority): 

Date (day/month/year) Number Country {or regional Office) 


Box No. VIII CHECK LIST: LANGUAGE OF FILING 



This international application contains 
the following number of sheets: 



request 

description (excluding 
sequence listing part) 

claims 

abstract 

drawings 

sequence listing part 
of description 

Total number of sheets 



32 



This international application is accompanied by the item(s) marked below: 





l.S 


5 


2-n 






16 


4.n 


2 


5.n 


1 


6.n 


8 


7.n 








9.gl 



Figure of the drawings which 

should accompany the abstract: 2 


Language of filing of the 

international application: English 


Box No. IX SIGNATURE OF APPLICANT OR AGENT 



Next to each signature, indicate the name of the person signing and the capacity in which the person signs (if such capacity is not 
obvious from reading the request). j 




GERARD J. WEISE 



'^^^I^^^^^^^:^'"''''''''^ JC09Rec'dPCT/PTO 2 8 FEB 2001 


2. Drawings: 
1 1 received: 

1 1 not received; 


3. Corrected date of actual receipt due to later but 
timely received papers or drawings completing the 
purported international application: 


4. Date of timely receipt of the required 
corrections under PCT Article 1 1(2): 


5. International Searching Authority to a / / y 
(if two or more are competent): loAy {A 


6. 1 1 Transmittal of search copy delayed 
1 1 until search fee is paid. 



For International Bureau use only 



Date of receipt of the record copy 
by the International Bureau: 
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GENERAL POWER OF ATTORNEY 

(f0t «v-rti/ initrandOflai (ipptk^tiQr^pi^ wW«r IA* Cooperation Tteow 

(PCX Rule 



Autwrn Unlvw»»ty, 309 SatnfonJ Ha«, AUDqm UnivcraHy^ AL 36648, OS 



■snat 



common rcprDScntitivQ 



hereby 4rj>ctnU» ftUowija» p«r»on iwt 



WEiaER. Oerafd J. 
Miflor, Austin R. 
DroDiJa, Jsmes A. 
Mlttv, Auslln R, 
Kluflof , Joan T. 



Reg .No. 10,783 
P»g. Nq. 16.602 
Refl, Net Ifi.fiQO 
Reg. No. 16,602 
ft»g.lMo. 3d.940 
Reg. NO. 4^.201 
Reg.No.P4a,t>52 



Cnristsnbuiv. T. Oani»l Rag, No. 31,760 



McKWcy. RotMtrtA. 
Fenick, Shewn 
PBldri«« Micnaftl A. 
Rowe. P«Ilc)Cy 



R»Q. NO. 43.793. 
Rftg. No. 4i.269 
Reg, No- *2,5«2 
R«g.No. 47,042 



(0 repiwni ihe unU<r»i(jn«J before 



|gJ»U ihc ccwipctcTtt Intemttionil Authoritia 
Qlhf l/UcmiLionol Senrehioj! Auaiori;y only 
Q ihfl Imemuionttl Preliminary Exsjninmg Auihwi^y only 



Jr, donnaciiDn with any and all im«Tni:iolwt applldiciDttt ^llad by ih< «Tl<ler»itlw<J M\h Ift* fbllwing OOltio 
(jg «» rccavina Ornc« 



«nd W nwlcv t>r p»ymcnis on bchAtroftbc undcnit^n^d. 

rJJSjr ifc^^ in y^ich thtpirs<m iignt. tfsu^^ capacity fa not c^^vi^us/^ tta4.n$ tku pc'^r): 



By. 

Printed Nttme:, 
Dttte: 



CMIehaelMortir^ 



A&S6fcta»HWV(Mmn 3 

Vice FrnkJent for Resaaroh 



i 
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GENERAL POWER OF ATTORNEY 

(ft}r jmvrot in/wmatiofta/ oppUcations filtd und^r ih^ Patent Cocper^Qts TrrutyJ 

(PCT Rule 90.5) 



The undersifnod person{a): 

(family aum€ foitawitd by |*jVdn namt: for a fuU ifgaf enliey, Jm(1 official tloit^naU^n, The atUlnus must incMe ficxta! code and 
immf of country.) 

Univer»ty of Oentrat Florida. 4000 Central FforitU Qoul«vard, Orlando, Florida 32dl6, US 



h«reby Appoim(3) the followmj^ person as: 



n 



cominort reprcicnuctvc 



(Ftuniiy H0mt Jollowed by givtn tuajnt; ybr o (c^t €nary»futl ojflcial dvrtsnafioit. Thtr address ntwtt inchdf postal codr and namk of 
ctfuntfy^ 



WEISER. C«mrd J. Rag . Nq. 19,763 
Mdler, Austin R. Reg. No. 16.6Q2 



Orobilc. Jamos A. 
M'tllflr, Austin R. 
Kfuger, Joan T. 
W*f9ner. Slowart M. 
Yeung, Stsphonld 



Reg. NO. 19,690 
Reg. No. 16.602 
Rflg. No. 38.940 
Reg. No. «6^01 
Bmq, no. IHa,052 



CbriatAOtwfy. T. Daniel Reg. NO. 31.750 
Donaliotta Guy T. Reg. no. ^.167 



McKinley. Robert A. 
P^xk. Stiaron 
Patane. Michael A. 
Rowe. Felicily 



Reg. No. ^3.793. 
Reg. No. 45*209 
Refl. No. 42.982 
Reg. No. 47.042 



to rcprdcnc ihc undersigned before 



tttl ihc competeni Inicmational Authoriitea 
I [ ifag Inienurfoiuil Sonrchiitg Authority only 
I | lhe lTi(cfn»nOd«l PrclimiiUry Eiuuniiuns Authority Only 



in connccifon wiUi wiy and all intorn^tKmal ■ppUc«lions flicd by the luidcrsiencd wiih (he roltowin$ OfTice 
US 



as receiving Office 



aiftd lo m»kc or receive payutents on balialf of ifae MmJcratgn^U. 



S(ciialure(t) (whera therx arc xctnirat persons, each t^fthrm must xi^n; next to racA it^nuiure^ i/tdicate the ttcnir tff fhc perSoa 
sigmng. and the^ Clipacily in which (fie pmrs^^n jiens. tfxn<h capacity a not ^bviinafrom fitutiitig this pa^mr): 



By: 




Printed Name; 



Form PCT^odel of eciuraU powor of atlomey (for several inccmaiionul upplicaUons) (July 



lavalCtr 1 W7. Poim PCTM2 



e0/20-d 9fTS £28 i0t7 ndd HOay3S3a agyOSNOdS T£:60 T002-92 



This sheet is not part of and does not count as a sheet of the international application. 
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FEE CALCULATION SHEET 
Annex to the Request 



CALCULATION OF PRESCRIBED FEES 

. TRANSMITTAL FEE 



2. SEARCH FEE 

International search to be carried out by 



appli 

3. INTERNATIONAL FEE 
Basic Fee 

The international application contains 

first 30 sheets 

2 X $9.00 



Applicant's or agent's 
file reference 



1462-PCT-OO 



For receiving Office use only 



PCT/us ul/06287 

International application No ^ 



Date stamp of the receiving Office 



Applicant 

Auburn University and University of Central Florida 



240.00 



700.00 



(If two or more International Searching Authorities are competent in relation to the international 
application, indicate the name of the Authority which is chosen to carry out the international search.) 



32 



sheets. 



bl 



additional amount 



18.00 



remaining sheets 

Add amounts entered at b 1 and b2 and enter total at B 
Designation Fees 

The international application contains ?Z — designations. 

6 X 82.00 



bl 



^Ol) -4^TO0f B 



number of designation fees 
payable (maximum 6) 



amount of designation fee 



492.00 



D 



Add amounts entered at B and D and enter total at I 

(Applicants from certain States are entitled to a reduction of 75% of the 
international fee. Where the applicant is (or all applicants are) so entitled, the 



-««:00 



4. FEE FOR PRIORITY DOCUMENT ("i^appZ/caWe; 

5. TOTAL FEES PAYABLE 

Add amounts entered at T, S, I and P, and enter total in the TOTAL box 



15.00 



TOTAL 



I I The designation fees are not paid at this time. 



MODE OF PAYMENT 

□ authorization to charge 
deposit account (see below) 



cheque 

□ 

postal money order 



I I bank draft 

I I cash 

I I revenue stamps 



I I coupons 

I I other (specify): 



/6 



DEPOSIT ACCOUNT AUTHORIZATION (this mode of payment may not be available at all receiving Offices) 
The RO/ US j j is hereby authorized to charge the total fees indicated above to my deposit account. 

Pvl (this check-box may be marked only if the conditions for deposit accounts of the receiving Office so permit) i; 
hereby authorized to charge any deficiency or credit aoy overpayment in the total fees mdicated above to m: 
deposit account. / 

□ is hereby authorized to charge the fee for preparation ay(d transmittal of ih^ pjiority document to the International 
Bureau of WIPO 



13-3405 



Deposit Account No. 



authorized to charge the fee for preparation and transmittal of the pjiority document to tne 
WIPO t© my deposit account. / I I \ L y/^A K/h ^ 

Date (d\y/monthJyear) Signature 



Form PCT/RO/101 (Annex) (January 2001) 



LegalStar 2001, Form PCTRFEE 
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0 



^0 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
7 September 2001 (07.09.2001) 




illlilllll^^ 



PCT 



(10) International Publication Number 

WO 01/64927 Al 
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EXPRESSION OF AN ANTIMICROBIAL PEPTIDE VIA THE 
PLASTID GENOME TO CONTROL 
PHYTOPATHOGENIC BACTERIA 

RELATED APPLICATIONS 
This patent application claims the benefit of U.S. Provisional Application No. 
60/1 85,662, filed 2/29/00. This application is here incorporated by reference. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 
The work of this invention is supported in part by the USDA-NRICGP grants 95-82770, 
97-35504 and 98-0185 to Henry Daniell. 



FIELD OF INVENTION 
This appUcation pertains to the field of genetic engineering of plant genomes, particularly 
15 plastids, and to methods of and engineered plants that express antimicrobial peptides that lead to 
and result in phytopathogenic bacteria resistance. 

DESCRIPTION OF RELATED ART 
Zasloff, in U.S. patent 5,643,876 and 4,810,777, entitled "Biologically Active Synthetic 
Magainin Peptides" and "Antimicrobial Compounds," described a family of synthetic 
compounds termed "magainin which are capable of inhibiting the growth or proliferation of 
gram-positive and gram negative bacteria, fungi, virus, and protozoan species. 

Haynie, in U. S. patent 5,847,047, entitled "Antimicrobial Composition of Polymer and a 
Peptide Forming Amphiphilic HeUces of the Magainin-Type," offers a series of non-natural 
oligopeptides that share a common amino acid sequence referred to as the core oUgopeptide. 
Such core oligopeptide has antimicrobial effects. The patent also provides N-addition analogues 
to the core oligopeptide that exhibit higher antimicrobial effects. 

Olsen et. al., in U. S. patent 6,143,498, entitled "Antimicrobial Peptide," proposed a 
method of producing human antimicrobial peptides from the defensin superfamily through 
transformation of host cells. Olsen suggested the production of these defensin-related peptides 
through transformation of host cells with vectors contaming the isolated DNA molecules of the 
peptides. 

Kim, et. al., in U. S. patent 6,183,992, entitled "Method For Mass Production Of 
Antimicrobial Peptide," offered a method of mass producing an antimicrobial peptide. In 
particular, a fusion gene - containing a basic antimicrovial peptide which ligated directly or 
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indirectly to a negatively charged acidic peptide having at least two cysteine residues - is cloned 
into an expression vector targeted toward microorganisms such as E. Coli. 

All patents and publications are hereby incorporated by reference in their entireties. 

BACKGROUND OF THE INVENTION 
Plant diseases caused by bacterial pathogens have had a detrimental effect on global crop 
production for years. Between 1979 and 1980 India lost up to 60% of its rice crop due to bacterial rice 
blight. Between 1988 and 1990, there was a 10.1% loss of the global barley crop due to bacterial 
pathogens, worth $1.9 biUion (Baker et al.. 1997). In the United States, there was an estimated 44,600 
metric ton reduction of soybean crops due to bacterial pathogens in 1994 (Wrath et al., 1996). On the 
average, pathogens are responsible for a 12-13% reduction of global crop production each year 
(Dempsey et al., 1998). 

A prior effort to combat these devastating pathogens is plant breeding (Mourgues et al., 1998). 
The results were Ihnited due to the ability of the bacteria to adapt and find a way around the defense 
mechanism. Agrochemicals have also been used but their apphcation is limited by their toxicity to 
humans and the environment (Mourgues et al, 1998). 

Plant Defense Against Pathogens: Many of the pathways and products in the plant response to 
phytopathogens have been elucidated with the emergence of molecular biology. The plant defense 
response can be divided into 3 major categories, early defense (fast), local defense (fast/intermediate) 
and systemic defense (intermediate to slow) (Mourgues et al., 1998). During the early stage, the plant 
cell is stimulated by contact with pathogen-produced elicitors. Bacterial genes such as hrp 
(hypersensitive response and pathogenicity) or avr (avirulence) genes stimulate the plant defense 
mechanism (Baker et al., 1997). The most prominent early defense response is the HR (hypersensitive 
response), which leads to cellular death reducing further infection by the pathogen. Local defense 
entails cell wall reinforcement, stimulation of secondary metabolite pathways, synthesis of thionins and 
synthesis of PR (pathogenesis-related) protems (Mourgues et al., 1998). The final phase is known as 
SAR (systemic acquired resistance), which protects the umnfected regions of the plant. 
Engineering Resistance: Genetic engineering has allowed for some enhancement of natural defense 
genes firom plants by clonmg and over-expression in non-host plants. Clonmg of resistance (R) genes 
has been used to protect rice firom bacterial leaf blight (Mourgues et al., 1998). Pathogenesis-related 
(PR) genes have been cloned fiom barley and have shown to provide resistance to P. syringae pv. 
(abaci (Mourgues et al., 1998). Anti-fiingal peptides produced by various organisms have been cloned 
and studied. However, although anti-fimgal development has been promising, bacteria still maintain 
35 the ability to adapt to plant defenses. 



20 



25 



30 



2 



10 



15 



20 



25 



30 



35 



PCT/USOl/06287 

Those skilled in the art will be familiar with antimicrobial peptides. Examples of some of these 
substances include PGLa (frog skin), defensins (human phagocytes), cecropins (Silkmoth pupae or pig 
intestine), apidaecins (honeybee lymph), melittin (bee venom), bombinin (toad skin) and the magainins 
(frog skin). Specifically bactericidal peptides include large polypeptides such as lysozyme (MW 15000 
daltons) and attacins (MW 20-23,000 daltons) as well as smaller polypeptides such as cecropin (MW 
4000 daltons) and the magainins (MW 2500 daltons), The spectrum of biocidal activity of these 
peptides is somewhat correlated to size. In general, the large polypeptides are active against limited 
types and species of microorganisms (e.g., lysozyme against only gram positive bacteria), whereas 
many of the smaUer oUgopeptides demonsfrate a broad spectrum of antimicrobial activity, killing many 
species of both gram positive and gram negative bacteria. It has been shown that magainin, cecropins, 
and bombinin oligopeptides form similar secondary structures described as an amphiphilic heUx 
(Kaiser et al. Annu. Rev. Biophys. Biophys. Chem 16, 561-581, 1987). These peptides with a-helical 
structures are ubiquitous and found in many organisms. They are beUeved to participate in the defense 
against potential microbial pathogens. One of the first biocidal oligopeptides to be isolated from 
natural sources was bombinin and is described by Csordas et al. (Proc. Int. Symp. Anim. Plant Toxins. 
2, 515-523, (1970)). Csordas teaches significant sequence homology between bombinin and melittin, 
another antimicrobial peptide, isolated from bee venom. 

SpecificaUy. the role of magainins from Xenopus laevis (African frog) and its analogues have 
been investigated by Zasloff et al. (WO 9004408) as pharmaceutical compositions such as a broad- 
spectrum topical agent, a systemic antibiotic; a wound-healing stimulant; and an anticancer agent 
(Jacob and Zasloff, 1994). Cuervo et al. (WO 9006129) describe the preparation of deletion analogues 
of magainin I and II for use as phannaceutical compositions. They disclose a general scheme for the 
synthetic preparation of compounds with magainin-like activity and structure. However, the possible 
agricultural use of magainin-type antimicrobial peptides has not yet been explored. Accordingly, it is 
an objective of this invention to demonstrate the conference of phytopathogenic bacteria resistant to 
plants by transforming plant cell plastids to express magainin and its analogues. 
Plastid Transformation: To date, plastid fransfomiation, particularly has enabled generation of 
herbicide (DanieU et al., 1998), insect resistant crops (Kota et al., 1999; McBride et al., 1995; DeCosa 
et al., 2000) and production of pharmaceutical proteins (Guda et al, 2000; Staub et al., 2000). Plastid 
transformation was selected because of several advantages over nuclear transforation (DanieU. 1999 
A, B; Bogorad, 2000; Heifetz, 2000). With concern growing about outcrossing of genetically altered 
genes, it should be noted that plastid expressed genes are maternally inherited in most crops. Gene 
containment is possible when foreign genes are engineered via the plastid genome, which prevents 
poUen transmission in crops that maternally inherit the plastid genome. Because a majority of crop 
plants inherit their plastid genes maternally, the foreign genes do not escape into the environment 
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Although pollen from plants that exhibit maternal mheritance contain metaboUcally active plastids, the 
plastid DNA is lost during pollen maturation (Helfetz. 2000). Despite the potential advantage of plJstid 
reproduction of AMPs. it was not obvious that AMPs would be produed in- this manner. Prior to the 
patent application there were no pubUshed reports of expression of AMPs in plant plastids. 
5 Non-obviousness of the disease resistance. Several foreign genes have been expressed within 
plastids to introduce novel traits including herbicide resistance or insect resistance. However, all 
of these foreign proteins, without exception, function within plastids. For example, herbicides 
target proteins or enzymes present within plastids. When engineered plastids are consumed by 
target insects, insecticidal proteins are released inside the insect gut. 
1 0 However, in order to use the chloroplast compartment to engineer disease resistance, it 

was necessary to export foreign proteins into the cytosol where phytopathogens colonize. 
Therefore, it wa.s not obvious to engineer the nlastid a e nome to confer disease re.qistanr.p Th^r. 
are no prior reports or suggestions in the liter a ture that plastid genome could be engineered tn 
confer disease re.S7.stance Also, it is known in the art that antimicrobial peptides are toxic to 
plant chloroplasts because of the charge on the chloroplast membranes. However, this mvention 
teaches that transgenic plastids expressmg antimicrobial peptides rupture at the site of infection 
upon cell death. Release of large amounts of the antimicrobial peptide prevent the spread of the 
phytopathogen. Thus, the present invention confmns a novel and unobvious solution to combat 
phytopathogens that is previously unknown and contrary to all current understanding of 
20 chloroplast biology. 

Most importantly, small peptides are not stable inside living cells and are highly 
susceptible to proteolytic degradation. For this reason, small peptides are usually produced as 
fusion protems with larger peptides in biological systems. Megainin type peptides are 
chemicaUy synthesized and never made in biological systems for that reason. Therefore, it was 
25 not obvious to express a small peptide of a few amino acids within plastids. Successful 

expression of this antimicrobial peptide was not anticipated but this invention opens the door for 
expression of several small peptides within plastids, including hormones. 
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SUMMARY OF THE INVENTION 
This invention provides a new option in the battie against phytopathogenic bacteria 
Ihrough transformation of the plant plastid genome. The present invention is appUcable to all 
plastids of plants. These include chromoplasts which are present in the fruits, vegetables and 
flowers; amyloplasts which are present m tubers like the potato; proplastids m roots; leucoplasts 
and etioplasts, both of which are present m non-green parts of plants. All known methods of 
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transformation can be used to introduce the vectors of this invention into target plant plastids 
including bombardment, PEG Treatment, Agrobacterium, microinjection, etc. 

This invention provides plastid expression constracts which are usefiil for genetic 
engineering of plant cells and which provide for enhanced expression of a foreign peptide in 
plant cell plastids. The transformed plant is preferably a metabolically active plastid, such as the 
plastids found in green plant tissues including leaves and cotyledons. The plastid is preferably 
one which is maintained at a high copy number in the plant tissue of interest 

The plastid expression constructs for use in this invention generally include a plastid 
promoter region and a DNA sequence of interest to be expressed in transformed plastids. The 
DNA sequence may contain one or a number of consecutive encoding regions, one of which 
preferably encoding an antimicrobial peptide of the magainin family. Plastid expression 
construct of this invention is linked to a construct haviag a DNA sequence encoding a selectable 
marker which can be expressed in a plant plastid. Expression of the selectable marker allows the 
identification of plant cells comprismg a plastid expressing the marker. 

In the preferred embodiment, transformation vectors for transfer of the construct into a 
plant cell include means for inserting the expression and stelection constructs into the plastid 
genome. This preferably comprises regions of homology to the target plastid genome which 
flank the constracts. 

The plastid vector or constructs of the invention preferably include a plastid expression 
vector which is capable of importing phytopathogenic bacteria resistance to a target plant species 
which comprises an expression cassette which is described further herein. Such a vector 
generally includes a plastid promoter region operative in said plant ceils' plastids, a DNA 
sequence which encode at least an antimicrobial peptide of the magainin family. Preferably, 
expression of one or more DNA sequences of interest will be in the transformed plastids. 

The preferred embodiment of the invention provides a universal plastid vector 
comprising a DNA construct. The DNA construct includes a 5' part of a plastid spacer 
sequence; a promoter, such as Prm, which is operative in the plastid of the target plant cells; a 
heterologous DNA sequence encoding at least one antimicrobial peptide of the magainin family; 
a gene that confers resistance to a selectable marker such as the aadA gene; a transcription 
termination region fimctional m the target plant cells; and flanking each side of the expression 
cassette, flanking DNA sequences which are homologous to a DNA sequence of the target 
plastid genome, whereby stable integration of the heterologous coding sequence into the plastid 
genome of the target plant is faciUtated through homologous recombination of the flanking 
sequence with the homologous sequences in the target plastid genome. The vector may further 
comprise a ribosome binding site (rbs), a 5' untranslated region (5'UTR). A promoter, such as 
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psbA, accD or 16srRNA, is to be used in conjunction with the 5'UTR. In addition to the 
encoding region of the anthnicrobial peptide, the heterologous DNA sequence of the DNA 
construct may also include other genes whose expression are desired. 

In another embodiment of the invention, non-universal plastid vectors such as pUC, 
5 pBlueScript, pGEM may be used as the agent to insert the DNA construct 

This invention provides transformed crops, like solanaceous, monocotyledonous and 
dicotyledonous plants, that are resistant to phytopathogenic bacteria. Preferably, the plants are 
edible for mammals, including humans. These plants express an antimicrobial peptide at levels 
high enough to provide upwards of 96% inhibition of growth against Pseudomonas syringae, a 
major plant pathogen. The transformed plants do not differ morphologically from untransformed 
plants. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. (A) Chloroplast vector used for transformation of Nicottana tabacum vai. Petit 
15 Havana. Vector contains the aadA selectable marker gene that confers resistance to 
spectinomycin, the Prm promoter, and the TpsbA terminator. (B) Amino acid sequence of the 
lytic peptide MSI-99. 

Figure 2. (A) Phenotype of Tq and Titransgenic plants. Plantsl-3 are To transgenic plants while 
plant 4 is untransformed. Plants 5-7 are T, transgenic plants. Seedlings germinated on 
20 MSO+500Mg/ml spectinomycin (B). Three Ti transgenic lines (1-3) and Control (4). 

Figure 3. (A) Primers, 8P and 8M used to confirm integration of foreign genes via PGR. 8P 
anneals with the 5'end of the aadA gene and 8M anneals with the 3'end of the 16S rDNA gene. 
PGR analysis of DNA extracted from To (B), T, (C) and T2 (D) plants run on a 0.8% agarose gel. 
To (B) Lane 1 Ikb ladder, 2 through 5 fransgenic lines, 6 MSI-99 plasmid. Ti (G) Lane 1, Ikb 
ladder, 2 through 4 transgenic, lane 5 plasmid control and lane 6 untransformed plant DNA. Tj 
(G) lane 1, Ikb ladder, 2 through 5 transgenic, lane 6 plasmid control and lane 7 untransformed 
plant DNA. 

Figure 4. Southem analysis of To and Ti generations. (A) Probe used to confirm integration of 
foreign genes. The 2.3kb probe fragment was cut with BamHI and NotI contaitiing the flanking 

30 sequence. (B) Lane 2-6 To transgenic lines, lane 1 untransformed and Lane 7 plasmid DNA. (C) 
Lanes 2-7 Ti transgenic lines. Lane 1 untransformed and Lane 8 plasmid DNA. 
Figure 5. In situ bioassays. 5 to 7mm areas of To transformants and untransformed Petit Havana 
leaves were scraped with fine grain sandpaper. Ten \A of 8xl0^ 8xl0^ 8x10^ and 8x10^ cells 
from an overnight culture of P. syringae were added to each prepared area. Photos were taken 5 

35 days after inoculation 
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Figure 6. In vitro bioassays for To, Tj and T2 generations of 3 transgenic Unes (lOA, llA and 
13 A). Five \x\ of bacterial cells from an ovemigjit culture were diluted to (Aeoo 0.1-0.3) and 
incubated for 2 hours at 25°C with 100^g of total plant protein extract. One ml of LB broth was 
added to each sample. Samples were incubated overnight at temperature appropriate for the 
specific bacteria. Absorbance at 600nm was recorded. Data was analyzed using GraphPad 
Prism. Negative control was untransformed plant extract. Buffer only was added as a control 
and stock culture was used as a reference point. 

Figure 7. In vitro bioassays for P. aeruginosa. Five |j.l of bacterial cells from an overnight 
culture were diluted to (Agoo 0. 1-0.3) and incubated for 2 hours at 25°C with 100|ag of total 
protein extract from T] plants. One ml of LB broth was added to each sample. Samples were 
incubated ovemight at 37''C. Absorbance at 600nm was recorded. Data was analyzed using 
GraphPad Prism. Negative control was an untransformed plant extract. Buffer only was added 
as a control and stock culture was used as a reference point. 

Figure 8. Five |xl of an overnight culture of P. syringae diluted to (Agoo 0.1-0.3) was mixed with 
15 100^g total protein extract from T2 lines 1 lA and 13A (germmated in the absence of 
spectinomycin). After 2-hour incubation, 1ml of LB broth was added to the mixture and 
incubated over night at 27^0. The following morning absorbance at 600„m was recorded (A). In 
parallel, 50jal of each mix was plated onto LB plates and incubated overnight at 27°C. The next 
morning a count of viable CPUs were made using the Bio Rad Gell Dock (B). 
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DETAILED DESCRIPTION OF THE INVENTION 
This mvention demonstrates the confering of phytopathogenic resistance in plants through 
plastid transformation. This invention includes the use of all plastids in plants, including chloroplasts. 
chloroplasts which are present in fruits, vegetables and flowers, amyloplasts which are present in 
25 tubers, proplastids in roots. lencoplasts in non-green parts of plants. In a preferred embodiment of the 
invention, the chloroplast genome is used. Plastid transformation and expression vectors comprising 
heterologous DNA encoding magainin and its analogues are provided . The anti-microbial peptide 
(AMP) used in this invention is an amphipathic alpha-helix molecule that has an affinity for negatively 
charged phospholipids commonly found m the outer-membrane of bacteria. Upon contact with these 
membranes, individual peptides aggregate to form pores in the membrane, resulting in bacterial lysis. 
Because of the concentration dependent action of the AMP, it was expressed via the plastid genome to 
accomplish high dose deUvery at the point of infection. PGR products and Southem blots confirmed 
plastid integration of the foreign genes and homoplasmy. Growth and development of the transgenic 
plants was unaffected by expression of the AMP within the plastids. In vitro assays with To, Ti and T2 
35 plants, confirmed the AMP was expressed at levels high enough to provide 86%(To), 88%(Ti) and 
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96%(T2) inhibition of growth against Pseudomonas syringae. a major plant pathogen. In situ assays 
resulted in intense areas of necrosis around the point of infection in control leaves, while transformed 
leaves showed no signs of necrosis. Even when germinated in the absence of spectinomycin selection, 
Tj generation plants showed 96% inhibition of growth against P.syringae. 
5 MSI-99 is an analogue of a naturally occurring peptide (magainin 2) found in the skin of the 

A&ican frog. Changes have been made to the amino acid sequence to enhance its lytic abilities. 
Contrary to the prior knowledge in the art which proposed that anti-microbial peptides having high 
antibacterial activity also have a high potential for toxic activity against the plastid (Everett and 
Nicholas, 1994), the transgenic plants of this invention grew, flowered and set seeds like the 
10 untransformed control. 

Key features of cationic peptides such as MSr-99 are a net positive charge, an affinity for 
negatively charged prokaryotic membrane phosphoUpids over neutral-charged eukaryotic membranes, 
and the abiUty to fonn aggregates that disrupt the bacterial membrane (Houston et al., 1997; Matsuzaki 
et al., 1999; Biggin and Sansom, 1999). Given the fact that the outer membrane is an essential and 
15 highly conserved part of all bacterial cells, it is highly unlikely that bacteria would be able to adapt (as 
they have against antibiotics) and to resist the lytic activity of these peptides. In contrast to prokaryotic 
membranes, the thylakoid membrane consists of primarily glycolipids and galactolipids instead of 
phospholipids. Monogalactosyldiacylglycerol (MGDG) makes up 50% of membrane lipid and 
digalactosyldiacylglycerol (DGDG) 30% (Siegenthaler et al., 1998). Both of these lipids are neutral. 
20 An object of this invention is to compartmentalize the expression of the MSI.99 within the 

plastid. Compartmentalization of lytic enzymes is a natural occurrence in plants. 
Compartmentalization serves two purposes: to increase the yield of the peptide and to deliver the 
peptide at the site of the infection. Due to the high copy number associated with plastid expression, a 
larger amount of the peptide is produced. The higher yield is important due to the concentration- 
25 dependent action of the anti-microbial peptide. Further, the peptide would be released at the site of 
infection during the HR response. When the HR response occurs, cells are lysed. This disrupts the 
osmotic balance and causes plastids to lyse. This would release the peptide at high concentration 
resulting m aggregation and formation of pores in the outer membrane of bacteria. This aids in the 
prevention of the spread of infection by bacteria. 
30 A high level of AMP expression can be expected due to the foUowmg reasons. The nature of 

plastids to move from a somatically unstable heteroplasmic state to a state of homoplasmy itself lends 
to high expression (Brock and Hagemann, 2000). The A+T % of MSI-99 is 51.39%, which is 
compatible with the Nicotiana tobacum plastid 61% A+T content (Bogorad et al., 1991; Shimada et al., 
1991). Also, published reports from our lab report expression of Cry2A operon (A+T content of 65%) 
35 at levels as high as 46% total soluble protein (DeCosa et al., 2000). 
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MSI-99 was most effective against P. syringae, evidenced by total inhibition of 1000 P. 
syringae cells with only ljig/1000 bacteria (Smith et al. unpublished data). Because the lytic activity of 
antimicrobial peptides is concentration dependent, the amount of antimicrobial peptide required to kill 
bacteria was used to estimate the level of expression in transgenic plants. Based on the minimum 
inhibitory concentration, it was estimated that transgenic plants expressed MSr-99 at 21 % of the total 
soluble protein. Without the availability of antibody for MSI-99, other dhect methods of protein 
estimation were not feasible. 
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Plastid vectors and plant transformation: The synthetic peptide used in this invention (MSI-99), is 
an analogue of the naturally occurring 23 amino acid peptide, magainin H. MSI-99 is a 22 amino acid 
sequence with an overall charge of +6 as shown in Figure 1. The gene cassette used for transformation 
consisted of the 16S rRNA promoter, the aadA gene, which confers resistance to spectinomycin, the 
MSI-99 gene and the psbA (photosynthetic binding protein) terminator. The gene construct may 
contain, in addition to the MSI-99 gene, another heterologous DNA sequence coding for a gene of 
15 interest. 

Flanking sequences are from the petunia plastid genome as shown m Figure lA. 
Transformation efficiency was much lower (7%) than that observed using the pLD vector (91%), which 
contains tobacco homologous flanking sequences. Other vectors that are capable of plastid 
transformation may be used to deliver the gene cassette into the plastid genome of the target plant cells. 

20 Such vectors do include plastid expression vectors such as pUC, pBlueScript, pGEM, and all others 
identified by Daniell in US patents number 5,693,507 and 5,932,479. These publications and patents 
are herein mcorporated by reference to the same extent as if each individual publication or patent was 
specifically and individually indicated to be incorporated by reference. The vectors preferably include 
a ribosome binding site (ibs) and a 5' untranslated region (5'UTR). A promoter operably in green or 

25 non-green plastids is to be used in conjunction with the 5 ' UTR) 

The number of transformants from the total number of shoots determined percent of 
transformants. Out of 55 spectinomycin resistant shoots screened, only 4 were transformants with the 
MSI-99 gene and the rest were mutants. All transformants grew healthy with no apparent 
morphological effects to To and Tj, generations as shown in Figure 2A. Tj, seeds germinated in the 

30 presence of spectinomycin produced healthy green seedlings, while contirol seedlings were bleached as 
shown in Figure 2B. 

Foreign gene integration, homoplasmy and copy number: PGR was performed by landing one 
primer on the 5'end of the aadA codmg sequence, not present in native plastid and the 3*end of the I6S 
rDNA (Figure 3A). PGR products of To, T,, and T2 generations yielded the same size product as the 
35 plasmid (MSI-99) as shown in Figure 3B,CJ) confirming integration of the foreign genes. The probe 
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used for the Southern analysis was a 2.3kb fragment Srom the 5'end of the tml (BamHI) to the 3'end of 
the 16SrDNA (Notl) (Figure 4A). The plant DNA was digested with BamHI. DNA from 
untransformed plants produced a 3.269kb fragment and transformed plant DNA produced a 4.65kb 
fragment. Southern analysis confirmed integration of foreign genes for To and Ti, as shown in Figure 
5 4B,C. Untransformed DNA showed a 3.2kb fragment while the transformed contoined a 4.65kb 
fragment. Presence of some wild type fragments in To transgenic samples mdicated some heteroplasmy 
as shown in Figure 4B. However, DNA from T,, generation produced only the 4.65Kb fragment 
confirming homoplasmy. As shown in Figure 4C. A cell is said to be homoplasmic when all of the 
plastid are uniformly transformed. If only a fraction of the genomes was transformed, the copy number 
10 should be less than 10,000 (Bendich, 1987). By confnming that the MSI-99 integrated genome is the 
only one present in transgenic plants (homoplasmy), one could estimate that the MSI-99 gene copy 
number could be as many as 10,000 per cell. 

Bioassays: Tq in situ assays in potted plants (6 to 7 months old) resulted in areas of necrosis 
surrounding the point of infection in untransformed control, while transgenic leaves showed no areas of 
15 necrosis (Figure 5). Even moculation of 8X10^ ceUs resulted in no necrosis in transgenic leaves (Figure 
5A), suggesting the local concentration of the antimicrobial peptide to be very high. However, 
untransformed plants inoculated with 8x10^ cells displayed intense necrosis as shown in Figure 5B. 

Cell free extracts of To, Tj, and T2 ttansgenic plants displayed a strong ability to mhibit growth 
of P. syringae in vitro by 84%, 86% and 96% compared to untransformed plants as shown m Figure 6. 
20 The mcrease in growth inhibition from Tq to T2 can be attributed to heteroplasmy in the To generation 
that was elunmated in subsequent generations. This indicates the peptides retamed their lytic activity 
and successfuUy passed on the trait to the subsequent generations. The conlrol had less growth than the 
buffer only. This is most probably due to natural defense peptides such as defensins and thionins 
produced by plants (Mourgues et al., 1998). When performing in vitro bioassays against />. aeruginosa, 
25 results were similar with Ti, generation showing 96% inhibition of growth (Figure 7). 

Absorbance readmgs as shown in Figure 8A from transgenic plants germmated in the absence 
of spectmomycin, displayed 96% mhibition of growth that is comparable to transgenic plants 
germmated in the presence of spectinomycin. Plated cells of bioassay samples from T2 plants 
germinated in the absence of spectinomycin as shown in Figure 8B showed 83% inhibition of growth 
compared to the control. The margmal degree of difference between the plating results and the 
bioassay results (13%) can be explamed by the difference in environment. While the plated bacteria 
were no longer exposed to active peptides, bacteria in the Uquid media were constantly surrounded by 
active peptides. 
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Protein Estimation: The plate with 10"^ dilution had 43 CPUs. The equated to 43x10^ CFU/ml. The 
count was adjusted to reflect the 5^1 of culture used. This resulted in a count of 21,500 bacterial cells 
in the initial 5^1 of culture incubated with the peptide. Using 1 to kill lOGO syringae ceUs as the 
reference (Smith et al. unpublished data), the estimated expression of MSI-99 was 21.5^g in lOOng 
soluble protein (21.5%). 

The initial low rate of transformation was most likely due to less than 100% homology between 
the petunia flanking sequences and the tobacco plastid genome. This is not surprising because very low 
transformation efficiency was also observed when tobacco plastid flanking sequences were used to 
transform potato plastid genome (Sidorov et al., 1999). Also, other projects in our lab that use the pLD 
vector (has tobacco flanking sequences) obtained transformation efficiency of 91% transfoimants to 
mutants. To and T] transgenic plants were healthy and showed no morphological or developmental 
abnormalities. Retention of lytic activity was evident in the sharp decrease in bacterial growth in the in 
vitro bioassays (84 to 96%). When comparing Southern blots to lytic activity, lytic activity increased 
as homoplasmy was reached. Equal lytic activity was also observed in transgenic plants germinated in 
the absence of spectinomycin (96% inhibition of growth). Transgenic plants transferred to potting soil 
for 5 to 6 months after being removed from spectinomycin selection, displayed similar antimicrobial 
properties against inoculations of P, syringae. These observations eliminate the possibility that 
spectinomycin absorbed into the plant tissue during germinationof seeds, may be responsible for the 
growth inhibition in the in vitro and in situ bioassays. Also, the observation that MSI-99 was equally 
active in transgenic plants germinated in the presence or absence of spectinomycin shows the stability 
of the introduced trait in the absence of any selection pressure. 

Plastid expression in crops such as tobacco should allow for mass production of the peptide at a 
lower cost compared to chemical synthesis or production in E. coli. This invention thus demonstrates 
another option m the on going battle agamst pathogenic bacteria. 

The mvention is exempUfied by the following non-limitmg example. 

Example 1 

Plant transformation: For plant transformation, Nicotiana tabacum var. Petit Havana seeds were 
germinated on MSO media at 27^C with photoperiods of 16 hour light and 8 hour dark. Sterile leaves 
were bombarded using the Bio-Rad Helium driven PDS-lOOO/He System. After bombardment, leaves 
were wrapped and kept in the dark for 48 hours. Leaves were then cut into Icm^ squares and placed on 
a petri dish containing RMOP media with 500^g/ml spectinomycin (first round of selection). Four to 
six weeks later, shoots were transferred to fresh media and antibiotic (second round of selection). 
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Shoots that appeared during the second selection were transferred to botties containing MSO and 
spectinomycin (SOOng/ml). Plants were screened "via PCR for transformation. Those that were PCR 
positive for the presence of the MSI-99 gene were transferred to pots and grown in chambers at li'C 
with photoperiods of 16-hour light and 8-hour dark. After flowering, seeds were harvested and 
5 sterilized with a solution of I-part bleach and 2-part water with 1 drop of tween-20. Seeds were 
vortexed for 5 minutes then washed 6 times with 500^1 of dHaO and dried in speed vac. T', and 
Tjseeds were germinated on MSO + 500ng/ml spectinomycin. Untransformed Petit Havana seeds were 
germinated on the same media as a control to ensure the spectinomycin was active. 
PCR conformation Plant DNA extraction on To, T,. and Tj was performed using the QIAGEN 
DNeasy Mini Kit on putative transgenic samples and untransfon-ned plants. PCR primers were 
designed using Primer Premier software and made by GIBCO BRL. Primer 
(SprS-ATCACCGCTTCCCTCATAAATCCCTCCCaO anneals with the 5' end of the aadA and primer 
(8M:5'CCACCTACAGACGCTTTACGCCCAATCA3') anneals with the 3' end of 16SrDNA as shown 
in Figure 3. PCR was carried out usmg the Gene Amp PCR system 2400 (Perldn-Ehner). Samples 
were run for 29 cycles with the following sequence: 94°C for 1 minute, 65°C for 1 minute and 72''C for 
3 minutes. The cycles were proceeded by a 94°C denaturation period and followed by a 72*C fmal 
extension period. A 4°C hold foUowed the cycles. PCR products were separated on agarose gels. 
Southern analysis: Integration of foreign genes for To and Tj, was determined by Southern blot 
analysis. DNA from transformed and untransformed plants was digested with BamHI and run on a 
20 0.7% agarose gel. The DNA was then transferred to a nylon membrane by capillary action. The probe 
was digested with BamHI and Notl and was labeled with 32 P using the Probe Quant™ G-50 Micro 
Columns and protocol (Amersham). Labeled probe was hybridized with the nylon membrane usmg the 
Stratagene QUICK-HYB hybridization solution and protocol. Membrane was exposed to fihn, and 
developed. 

25 In vitro bioassay: P. syringae and P.aeruginosa were cultured overnight prior to the assay. 50 mg of 
leaf tissue (minus mid-rib) was grounded in a micro-centrifuge containing ISOfxl of phosphate buffer 
pH5.5 with 5mM PMSF and 5mM with a plastic pestle. Samples were centrifuged for 5 mmutes at 
10,000x g at 4°C. Supernatant was transferred to a fresh tube and kept on ice. Protein concentration 
was determined by Bradford assay. One hundred fig of total plant protein was mixed with S^il of 
bacteria from overnight culture in a falcon tube. Initial absorbency ranged from 0.1 to 0.3 (A^oo). Tubes 
were incubated for 2 hours at 25''C on a rotary shaker at 125Tpm. One ml of LB broth was added and 
tubes were allowed to incubate for 18 hours at 27''C for P. syringae and 37°C for P. aeruginosa on a 
rotary shaker at 125rpm. Absorbance (Aaoo) was read for each tube. Results were statistically analyzed 
using GraphPad Prism. 
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To rule out spectinomycin as the cause of growth inhibition, the same experiment with P. 
syringae was repeated using T2 plants that were geminated on MSO with no spectinomycin. For 
confimiation of the absorption readings, a serial dilution was made of samples after the initial 2-hour 
incubation. Dilutions of 10"^ to 10"' were plated onto LB plates and incubated overnight at 27°C. The 
5 next morning a count of viable CPUs were made using the Bio Rad Gell Dock. 

To estimate the level of protein expression, a serial dilution was prepared from the starting 
bacterial culture (Absorbanceeoo, 0.1-0.3) used for the In vitro bioassay. Fifty nlof each dilution was 
plated on LB medium and incubated overnight at itc. The following morning, CPUs were couated 
using the Bio Rad Gel Dock and the amount of cells used in the bioassay was calculated. The 
10 minimum inhibitory concentration of Ing/1000 P.syringae cells was used to determine antimicrobial 
peptide concentration in lOOfigof cell free plant extracts. 

In situ bioassay: P. syringae was cultured overnight prior to the assay. Five to seven mm areas 
of To transformants and untransformed Petit Havana leaves were scraped with fine grain 
sandpaper. Ten ^il of 8xlO^ 8xlO^ 8XI0' and 8x10^ cells from an overnight culture of P. 
1 5 syringae were added to each prepared area. Photos were taken 5 days after inoculation. 
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"What is claimed is: 



1 . A stable plastid transformation and expression vector which comprises an 
expression cassette comprising, as operably linked components in the 5' to the 3' direction of 

5 translation, a promoter operative in said plastid, a selectable marker sequence, a heterologous 
DNA sequence coding for cytotoxic antimicrobial peptide (AMP), transcription termination 
functional in said plastid, and flanking each side of the expression cassette, flanking DNA 
sequences which axe homologous to a DNA sequence of die target plastid genome, whereby 
stable integration of the heterologous codmg sequence into the plastid genome of the target plant 
10 is facilitated through homologous recombination of the flanking sequence with the homologous 
sequences in the target plastid genome. 

2. A vector of claim 1 , wherem the plastid is selected from the group consisting of 
chloroplasts, chromoplasts, amyloplasts, proplastide, leucoplasts and etioplasts. 

3. A vector of clami 1, wherein the antimicrobial peptide is selected from the groups 
1 5 of defensins, PGLA (frog skin), cecropins, apidaecins, mehttin, bombinin and magainin. 

4. A vector of claim 3 , wherein the antimicrobial peptide is magainui I or 11. 

5. A vector of claim 1 , wherein the selectable marker sequence is an antibiotic-free 
selectable marker. 

6. A universal mtegration and expression vector of claim 1 competent for stably 
20 transfoiming a plastid genome of different plant species wherein the flanking DNA sequences 

are homologous to a spacer sequence of the target plastid genome and the sequence is conserved 
in the plastid genome of different plant species. 

7. A stably transformed plant which comprises plastid stably transformed with the 
vector of claims 1, 2, 3, 4, 5 or 6 or the progeny thereof, including seeds. 

^- A stably ti-^sformed plant of claim 7 which is a solanaceous plant. 

9. A stably transformed plant of claim 7 which is a monocotyledonous or 
dicotyledonous plant. 

10. A stably transformed plant of claun 9 which is maize, rice, grass, rye, barley, oat, 
wheat, soybean, peanut, grape, potato, sweet potato, pea. canola, tobacco, tomato or cotton. 

•^^ A stably transformed plant of claim 7 which is edible for mammals and humans. 

12. A stably tiransformed plant of claim 7 in which all the chloroplasts are uniformly 
transfonned. 

13. A stably ti-ansformed plant of claun 7 in which the tiansformed plastid of the 
plants including subsequent generations are capable of enhanced levels of expression. 
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14. A stably transformed plant of claim 7 in which transgenic plants germinated in the 
absence of antibiotic selectable marker sequence, like spectinomycin. 

15. A method for stably transforming a target plant to control a-phytopathogenic 
bacteria which comprises introducting an integration and expression vector of claims 1, 2, 3, 4, 5 

5 or 6 into a plastid genome of the target plant, and aUowing the transformed plant to grow. 

16. A vector of any one of claims 1 - 14, wherein the antimicrobial peptide is a 
cationic amphiphipathic alpha-helix molecule which has affmity for negatively charged 
phospholipides in the outer membrane of the target bacteria and which is functional to form 
aggregates that disrupt and lyse the bacterial membrane of the target microbe, and in the 

1 0 prevention of the spread of infection by the bacteria. 

17. A vector of any one of claims 1-14, wherein said vector further comprises a 
ribosome binding site (rbs) and a 5' untranslated region (5'UTR). 

18. A method of claim 1 5, wherein said vector further comprises a ribosome binding 
site (rbs) and a 5' unti-anslated region (5'UTR). 

15 
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1- A stable plastid transforation and expression vector which comprises an 
expression cassette comprising, as operably liniced components in the 5' to the 3' direction of 
translation, a promoter operative in said plastid. a selectable marker sequence, a heterologous 
DNA sequence coding for cytotoxic antimicrobial peptide (AMP), transcription termination 
functional in said plastid, and flanking each side of the expression cassette, flanking DNA 
sequences which are homologous to a DNA sequence of the target plastid genome, whereby 
stable mtegration of the heterologous coding sequence into the plastid genome of the target plant 
.s facilitated through homologous recombination of the flanking sequence with the homologous ' 
sequences in tiie target plastid genome. 

2. A vector of claim 1, wherein die vector further comprises a ribosome binding site 
(RBS) and a 5' untranslated region (5'UTR). 

3. A vector of claim 1. wherein the plastid is selected from tiie group consisting of 
chloroplasts. chromoplasts, amyloplasts. proplastide. leucoplasts and etioplasts. 

4. A vector of claim 1. wherein tiie antimicrobial peptide is a cationic 
amphiphipathic alpha-helix molecule which has affinity for negatively charged phosphoUpides 
located m an outer membrane of a target microbe and which is fimctional to fonn aggregates that 
disrupt and lyse the microbe membrane of the target microbe, and in preventing spreading of 
infection by the microbe. 

5. A vector of claim I, wherein the antimicrobial peptide is selected from the groups 
of defensins. PGLA (frog skin), cecropins. apidaecins. melittin. bombinin and magainins. 

6. A vector of claim 1. wherein the antimicrobial peptide is magainin I or an 
analogue thereof. 

7. A vector of claim 1. wherein the antimicrobial peptide is magainin II or an 
analogue tiiereof. 

8. The antimicrobial peptide of claim 7. wherein the analogue of magainin II is MSI- 

9. A vector of claim 1. wherein the selectable marker sequence is an antibiotic-free 
selectable marker. 

10. A vector of claim 1 which is a universal vector competent for stably transforming 
a plastid genome of different plant species wherein flanking DNA sequences are homologous to 
a spacer sequence of a target plastid genome and the sequences are conserved in the plastid 
genome of different plant species. 



-17- 



* 1462-pcT-oo SEP 2001 

11. A stably transformed plant which comprises plastid stably transformed with the 
vector of claim 1, and the progeny or subsequent generations thereof, including seeds. 

12. A stably transformed plant of claim 1 1 which is a solanaceous plant 

13. A stably transformed plant of claim 1 1 which is a monocotyledonous or 
5 dicotyledonous plant 

14. A stably transformed plant of claim 13 which is maize, rice, grass, rye, barley, 
oat, wheat, soybean, peanut, grape, potato, sweet potato, pea, canola, tobacco, tomato or cotton. 

1 5. A stably transformed plant of claun 1 1 which is edible for mammals and humans. 

1 6. A stably transformed plant of claun 1 1 in which all the plastids ai-e uniformly 
10 transformed. 

1 7. The stably transformed plant and progeny of claim 1 1 , wherein subsequent 
generations are capable of enhanced levels of expression of the cytotoxic antimicrobial peptide. 

18. A stably transfcmned plant of claim 1 1 in which transgenic plants germinated m 
the absence of antibiotic selectable marker sequence, like spectinomycin. 

• 5 1 9. A method for stably transforming a target plant to control a phytopathogenic 

bacteria which comprises transforming plastids of said target plant with a transformation and 
expression vector of claim 1, selecting transformed plant cells, and allowmg the transformed 
plant cells to grow. 

20. A method of claim 19, wherein said vector further comprises a ribosome bindmg 
20 site (RBS) and a 5 ' untranslated region (5 'UTR). 
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(57) Abstract: This inveniion provides a novel meihod to confer disease resistance to plants. Plant plastids arc translbrmed using 
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1 . A stable plastid transformation and expression vector which comprises an 
expression cassette comprising, as operably linked components in the 5' to the 3* direction of 
translation, a promoter operative in said plastid, a selectable marker sequence, a heterologous 
DNA sequence coding for cytotoxic antimicrobial peptide (AMP), transcription termination 
functional in said plastid, and flanking each side of die expression cassette, flanking DNA 
sequences which arc homologous to a DNA sequence of the target plastid genome, whereby 
stable integration of the heterologous coding sequence into the plastid genome of the target plant 
is facilitated through homologous recombination of the flanking sequence with the homologous 
sequences in the target plastid genome. 

2. A vector of claim 1, wherein the vector iurther comprises a ribosome binding site 
(RBS) and a 5' untranslated region (5'UTR). 

3. A vector of claim 1 , wherein the plastid is selected from the group consisting of 
chloropiasts, chromoplasts, amyloplasts, proplastide, leucoplasts and etioplasts. 

4. A vector of claim 1 , wherein the antimicrobial peptide is a cationic 
amphiphipathic alpha-helix molecule which has affinity for negatively charged phospholipides 
located in an outer membrane of a target microbe and which is functional to form aggregates that 
disrupt and lyse the microbe membrane of the target microbe, and in preventing spreading of 
infection by the microbe. 

5. A vector of claim 1, wherein the antimicrobial peptide is selected from the groups 
of defensins, PGLA (frog skin), cecropins, apidaecins, melittin. bombinin and magainins. 

6. A vector of claim 1, wherein the antimicrobial peptide is magainin 1 or an 
analogue thereof 

7. A vector of claim 1, wherein the antimicrobial peptide is magainin TI or an 
analogue thereof 

8. The antimicrobial peptide of claim 7, wherein the analogue of magainin II is MSI- 

99. 

9. A vector of claim 1, wherein the selectable marker sequence is an antibiotic-free 
selectable marker. 

10. A vector of claim I which is a universal vector competent for stably transforming 
a plastid genome of different plant species wherein flanking DNA sequences are homologous to 
a spacer sequence of a target plastid genome and the sequences are conserved in the plastid 
genome of different plant species. 



- 19 - 

AMENDED SHEET M 



wo 01/64927 



PCT/USOl/06287 



11. A stably transformed plant which comprises plastid slab] y transformed with the 
vector of claim 1 , and the progeny or subsequent generations thereof, including seeds. 

12. A stably transformed plant of claim 1 1 which is a solanaceous plant, 

13. A stably transformed plant of claim 1 1 which is a monocotyledonous or 
dicotyledonous pJanl. 

14. A stably transformed plant of claim 13 which is maize, rice, grass, rye, barley, 
oat, wheat, soybean, peanut, grape, potato, sweet potato, pea, canola, tobacco, tomato or cotton, 

15. A stably transformed plant of claim 1 1 which is edible for mammals and humans. 

16. A stably transformed plant of claim 1 1 in which all the plastids are uniformly 
transfomied. 

17. The stably transformed plant and progeny of claim 1 1 , wherein subsequent 
generations are capable of enhanced levels of expression of the cytotoxic antimicrobial peptide. 

IS, A stably transfonTied plant of claim I 1 in which transgenic plants germinated in 
the absence of antibiotic selectable marker sequence, like spectinomycin. 

19. A method for stably transforming a target plant to control a phytopatliogenic 
bacteria which comprises transforming plastids of said target plant with a transfonnation and 
expression vector of claim 1, selecting transformed plant cells, and allowing the transformed 
plant cells to grow- 

20. A method of claim 19, wherein said vector further comprises a ribosome binding 
site (RBS) and a 5' untranslated region (5'UTR), 
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RNA BINDING PROTEIN AND BINDING SITE 
USEFUL FOR EXPRESSION OF RECOMBINANT MOLECULES 

5 

Technical Fip][^ 

The invention relates to expression systems and methods for 
expression of desired genes and gene products in cells. 
Particularly, the invention relates to a gene encoding a RNA 
10 binding protein useful for regulating gene expression in cells, 
the protein binding site, a gene encoding a regulating protein 
disulfide isomerase and methods and systems for gene expression 
of recombinant molecules. 

15 BackaronnH 

Expression systems for expression of exogenous foreign 
genes in eukaryotic and prokaryotic cells are basic components 
of recombinant DNA technology. Despite the abundance of 
expression systems and their wide-spread use, they all have 
20 characteristic disadvantages. For example, while expression in 
E. coli is probably the most popular as it is easy to grow and 
is well understood, eukaryotic proteins expressed therein are 
not properly modified. Moreover, those proteins tend to 
precipitate into insoluble aggregates and are difficult to 
25 obtain in large amounts. Mammalian expression systems, while 

practical on small-scale protein production, are more difficult, 
time-consuming and expensive than in E. coli. 

A number of plant expression systems exist as well as 
summarized in US Patent No. 5,234,834, the disclosures of which 
3 0 are hereby incorporated by reference. One advantage of plants 
or algae in an expression system is that they can be used to 
produce pharmacologically important proteins and enzymes on a 
large scale and in relatively pure form. In addition, 
micro-algae have several unique characteristics that make them 
35 ideal organisms for the production of proteins on a large scale. 
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First, unlike most systems presently used to produce transgenic 
proteins, algae can be grown in minimal media (inorganic salts) 
using sunlight as the energy source. These algae can be grown 
in contained fermentation vessels or on large scale in monitored 
5 ponds. Ponds of up to several acres are routinely used for the 
production of micro-algae. Second, plants and algae have two 
distinct compartments, the cytoplasm and the chloroplast, in 
which proteins can be expressed. The cytoplasm of algae is 
similar to that of other eukaryotic organisms used for protein 

10 expression, like yeast and insect cell cultures. The 

chloroplast is unique to plants and algae and proteins expressed 
in this environment are likely to have properties different from 
those of cytoplasmically expressed proteins. 

The present invention describes an expression system in 

15 which exogenous molecules are readily expressed in either 

prokaryotic or eukaryotic hosts and in either the cytoplasm or 
chloroplast. These beneficial attributes are based on the 
discovery and cloning of components of translation regulation in 
plants as described in the present invention. 

20 Protein translation plays a key role in the regulation of 

gene expression across the spectrum of organisms (Kozak, Ann. 
Rev. Cell Biol. . 8:197-225 (1992) and de Smit and Van Duin, 
Prog. Nucleic Ac id Res. Mol . Biol. . 38:1-35 (1990)). The 
majority of regulatory schemes characterized to date involve 

25 translational repression often involving proteins binding to 
mRNA to limit ribosome association (Winter et al . , Proc. Natl . 
Acad. Sci . . u5A . 84:7822-7826 (1987) and Tang and Draper, 
BiQchem. , 29:4434-4439 (1990)). Translational activation has 
also been observed (Wulczyn and Kahmann, Cell . 65:259-26 9 

30 (1991)), but few of the underlying molecular mechanisms for this 
type of regulation have been identified. In plants, light 
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activates the expression of many genes. Light has been shown to 
activate expression of specific chloroplast encoded mRNAs by 
increasing translation initiation (Mayfield et al . , Ann. Rf^v , 
PUnt Physiol. Plant Mol . Rinl , 46:147-166 (1995) and Yohn et 
5 al., Mol. Cgll Biol 16:3560-3566 (1996)). Genetic evidence in 
higher plants and algae has shown that nuclear encoded factors 
are required for translational activation of specific 
chloroplast encoded mRNAs (Rochaix et al . , Embo .T . 8:1013-1021 
(1989), Kuchka et al . , £eil, 58:869-876 (1989), Girard-Bascou et 

0 al., Embo J. , 13:3170-3181 (1994), Kim et al.. Plant Mol 
127:1537-1545 (1994) . 

In the green algae Chlamydomonas reinhardtli, a number of 
nuclear mutants have been identified that affect translation of 
single specific mRNAs in the chloroplast, often acting at 

1 translation initiation (Yohn et al . , supra . (1996)). Mutational 
analysis of chloroplast mRNAs has identified sequence elements 
within the 5' untranslated region (UTR) of mRNAs that are 
required for translational activation (Mayfield et al . , supra . 
(1995), Mayfield et al . , J. Cell Rinl 127:1537-1545 (1994) and 

' Rochaix, Ann. Rev. Cell Biol . , 8:1-28 (1992)), and the 5- UTR of 
a chloroplast mRNA can confer a specific translation phenotype 
on a reporter gene in vivo (Zerges and Rochaix, Mol. Cfill Pt-inl 
14:5268-5277 (1994) and Staub and Maliga, Embo J. . 12:601-606 
(1993) . 

Putative translational activator proteins were identified 
by purifying a complex of four proteins that binds with high 
affinity and specificity to the 5- UTR of the chloroplast 
encoded psbA mRNA [encoding the Dl protein, a major component of 
Photosystem II (PS 11)] (Danon and Mayfield, Embo J. . 10:3993- 
4001 (1991)). Binding of these proteins to the 5' UTR of psjbA 
mRNA correlates with translation of this mRNA under a variety of 
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physiological (Danon and Mayfield, M^, (1991)) and biochemical 
conditions (Danon and Mayfield, Science, 266:1717-1719 (1994) 
and Danon and Mayfield, Embo .T. . 13:2227-2235 (1994)), and in 
different genetic backgrounds (Yohn et al . , supra , (1996)). The 
5 binding of this complex to the psbA mRNA can be regulated in 
vitro in response to both redox potential (Danon and Mayfield, 
Sslsnce. 266:1717-1719 (1994)) and phosphorylation (Danon and 
Mayfield, Embo J. , 13:2227-2235 (1994)), both of which are 
thought to transduce the light signal to activate translation of 
0 psbA mRNA. The 47 kDa member of the psbA RNA binding complex 
(RB47) is in close contact with the RNA, and antisera specific 
to this protein inhibits binding to the psjbA mRNA in vitro 
(Danon and Mayfield, supra . (1991) ) . 

Although the translational control of psbA mRNA by RB47 has 
5 been reported, the protein has not been extensively 
characterized and the gene encoding RB4 7 has not been 
identified, cloned and sequenced. In addition, the regulatory 
control of the activation of RNA binding activity to the binding 
site by nuclear- encoded trans-acting factors, such as RB60, have 
) not been fully understood. The present invention now describes 
the cloning and sequencing of both RB47 and RB60. Based on the 
translation regulation mechanisms of RB47 and RB60 with the RB47 
binding site, the present invention also describes a translation 
regulated expression system for use in both prokaryotes and 
' eukaryotes . 

Brief DeHr-ri ption of the Invention 

The RB47 gene encoding the RB47 activator protein has now 
been cloned and sequenced, and the target binding site for RB47 
on messenger RNA (mRNA) has now been identified. In addition, a 
regulatory protein disulfide isomerase, a 60 kilodalton protein 
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referred to as RB60, has also been cloned, sequenced and 
characterized. Thus, the present invention is directed to gene 
expression systems in eukaryotic and prokaryotic cells based on 
translational regulation by RB47 protein, its binding site and 
the RB60 regulation of RB47 binding site activation. 

More particularly, the present invention describes the use 
of the RB47 binding site, i.e., a 5- untranslated region (UTR) 
of the chloroplast psbA gene, in the context of an expression 
system for regulating the expression of genes encoding a desired 
recombinant molecule. Protein translation is effected by the 
combination of the RB47 binding site and the RB47 binding 
protein in the presence of protein translation components. 
Regulation can be further imposed with the use of the RB60 
regulatory protein disulfide isomerase. Therefore, the present 
invention describes reagents and expression cassettes for 
controlling gene expression by affecting translation of a coding 
nucleic acid sequence in a cell expression system. 

Thus, in one embodiment, the invention contemplates a RB47 
binding site sequence, i.e., a mRNA sequence, typically a mRNA 
leader sequence, which contains the RB47 binding site. A 
preferred RB47 binding site is psbA mRNA. For use in expressing 
recombinant molecules, the RB47 binding site is typically 
inserted 5- to the coding region of the preselected molecule to 
be expressed. In a preferred embodiment, the RB47 binding site 
is inserted into the 5- untranslated region along with an 
upstream psbA promoter to drive the expression of a preselected 
nucleic acid encoding a desired molecule. in alternative 
embodiments, the RB47 binding site is inserted into the 
regulatory region downstream of any suitable promoter present in 
a eukaryotic or prokaryotic expression vector. Preferably, the 
RB47 binding site is positioned within 100 nucleotides of the 
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translation initiation site. In a further aspect, 3' to the 
coding region is a 3 * untranslated region (3' UTR) necessary for 
transcription termination and RNA processing. 

Thus, in a preferred embodiment, the invention contemplates 
5 an expression cassette or vector that contains a transcription 
unit constructed for expression of a preselected nucleic acid or 
gene such that upon transcription, the resulting mRNA contains 
the RB47 binding site for regulation of the translation of the 
preselected gene transcript through the binding of the 

10 activating RB47 protein. The RB47 protein is provided 

endogenously in a recipient cell and/or is a recombinant protein 
expressed in that cell. 

Thus, the invention also contemplates a nucleic acid 
molecule containing the sequence of the RB47 gene. The nucleic 

15 acid molecule is preferably in an expression vector capable of 
expressing the gene in a cell for use in interacting with a RB47 
binding site. The invention therefore contemplates an expressed 
recombinant RB47 protein. In one embodiment, the RB4 7 binding 
site and RB47 encoding nucleotide sequences are provided on the 

20 same genetic element. In alternative embodiments, the RB47 

binding site and RB47 encoding nucleotide sequences are provided 
separately. 

The invention further contemplates a nucleic acid molecule 
containing the sequence encoding the 69 kilodalton precursor to 

25 RB47. In alternative embodiments, the RB47 nucleic acid 
sequence contains a sequence of nucleotides to encode a 
histidine tag. Thus, the invention relates to the use of 
recombinant RB47, precursor RB47, and histidine-modif ied RB47 
for use in enhancing translation of a desired nucleic acid. 

30 The invention further contemplates a nucleic acid molecule 

containing a nucleotide sequence of a polypeptide which 
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regulates the binding of RB47 to RB47 binding site. A preferred 
regulatory molecule is the protein disulfide isomerase RB60. 
The RB6 0 -encoding nucleic acid molecule is preferably in an 
expression vector capable of expressing the gene in a cell for 
use in regulating the interaction of RB47 with a RB47 binding 
site. Thus, the invention also contemplates an expressed 
recombinant RB60 protein. In one embodiment, the RB4 7 binding 
site, RB4 7 encoding and RB60 encoding nucleotide sequences are 
provided on the same genetic element. In alternative 
embodiments, the expression control nucleotide sequences are 
provided separately. In a further aspect, the RB60 gene and 
RB47 binding site sequence are provided on the same construct. 

The invention can therefore be a cell culture system, an in 
vitro expression system or a whole tissue, preferably a plant, 
in which the transcription unit is present that contains the 
RB4 7 binding site and further includes a (1) transcription unit 
capable of expressing RB4 7 protein or (2) the endogenous RB47 
protein itself for the purpose of enhancing translation of the 
preselected gene having an RB47 binding site in the mRNA. 
Preferred cell culture systems are eukaryotic and prokaryotic 
cells. Particularly preferred cell culture systems include 
plants and more preferably algae. 

A further preferred embodiment includes (1) a separate 
transcription unit capable of expressing a regulatory molecule, 
preferably RB60 protein, or (2) the endogenous RB60 protein 
itself for the purpose of regulating translation of the 
preselected gene having an RB47 binding site in the mRNA. In an 
alternative preferred embodiment, one transcription unit is 
capable of expressing both the RB47 and RB60 proteins. In a 
further aspect, the RB4 7 binding site sequence and RB60 sequence 
are provided on the same construct . 
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In one aspect of the present invention, plant cells 
endogenously containing RB4 7 and RB60 proteins are used for the 
expression of recombinant molecules, such as proteins or 
polypeptides, through activation of the RB47 binding in an 
exogenously supplied expression cassette. Alternatively, stable 
plant cell lines containing endogenous RB4 7 and RB60 are first 
generated in which RB4 7 and/or RB60 proteins are overexpressed. 
Overexpression is obtained preferably through the stable 
transformation of the plant cell with one or more expression 
cassettes for encoding recombinant RB47 and RB60. In a further 
embodiment, stable cell lines, such as mammalian or bacterial 
cell lines, lacking endogenous RB47 and/or RB60 proteins are 
created that express exogenous RB47 and/or RB60, 

Plants for use with the present invention can be a 
transgenic plant, or a plant in which the genetic elements of 
the invention have been introduced. Based on the property of 
controlled translation provided by the combined use of the RB47 
protein and the RB47 binding site, translation can be regulated 
for any gene product, and the system can be introduced into any 
plant species. Similarly, the invention is useful for any 
prokaryotic or eukaryotic cell system. 

Methods for the preparation of expression vectors is well 
known in the recombinant DNA arts, and for expression in plants 
is well known in the transgenic plant arts. These particulars 
are not essential to the practice of the invention, and 
therefore will not be considered as limiting. 

The invention allows for high level of protein synthesis in 
plant chloroplasts and in the cytoplasm of both prokaryotic and 
eukaryotic cells. Because the chloroplast is such a productive 
plant organ, synthesis in chloroplasts is a preferred site of 
translation by virtue of the large amounts of protein that can 
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be produced. This aspect provides for great advantages in 
agricultural production of mass quantities of a preselected 
protein product. 

The invention further provides for the ability to screen 
for agonists or antagonists of the binding of RB47 to the RB47 
binding site using the expression systems as described herein. 
Antagonists of the binding are useful in the prevention of plant 
propagation. 

Also contemplated by the present invention is a screening 
assay for agonists or antagonists of RB6 0 in a manner analogous 
to that described above for RB47. Such agonists or antagonists 
would be useful in general to modify expression of RB60 as a way 
to regulate cellular processes in a redox manner. 

Kits containing expression cassettes and expression 
systems, along with packaging materials comprising a label with 
instructions for use, as described in the claimed embodiments 
are also contemplated for use in practicing the methods of this 
invention . 

Other uses will be apparent to one skilled in the art in 
light of the present disclosures. 

Byieg p^scriptioA ot Drawings 

In the figures forming a portion of this disclosure: 
Figures lA-lD show the complete protein amino acid residue 
sequence of RB47 is shown from residues 1-623, together with the 
corresponding nucleic acid sequence encoding the RB4 7 sequence, 
from base 1 to base 2732. The nucleotide coding region is shown 
from base 197-2065, the precursor form. The mature form is from 
nucleotide position 197-1402. Also shown is the mRNA leader, 
bases 1-196, and poly A tail of the mRNA, bases 2066-2732. Both 
the nucleotide and amino acid sequence are listed in SEQ ID NO 
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5. 



Figures 2A-2B show the complete protein amino acid residue 
sequence of RB60 is shown from residues 1-488, together with the 
corresponding nucleic acid sequence from base 1 to base 2413, of 
5 which bases 16-1614 encode the RB60 sequence. Both the 

nucleotide and amino acid sequence are listed in SEQ ID NO 10. 

Figures 3A-3C show the complete sequence of the psbA mRNA. 
showing both encoded psbA protein amino acid residue sequence 
(residues 1-352) and the nucleic acid sequence as further 
10 described in Example 3 is illustrated. Both the nucleotide and 
amino acid sequence are listed in SEQ ID NO 13. 

Figure 4 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 ' to 3 ' , a promoter 
region derived from the psJbA gene for encoding the Dl protein 
15 from C. reinhardtii further containing a transcription 

initiation site (TS) , the RB47 binding site, a region for 
insertion of a foreign or heterologous coding region, a'RB47 
coding region, a RB60 coding region, and the 3 • flanking region 
containing transcription termination site (TS) , flanked by an 
20 origin of replication and selection marker. Restriction 

endonuclease sites for facilitating insertion of the independent 
genetic elements are indicated and further described in Example 



4A. 



Figures 5A-5B show the nucleotide and amino acid sequence 
25 of the RB47 molecule containing a histidine tag, the sequences 
of which are also listed in SEQ ID NO 14. 

Figure 6 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 • to 3 ■ , a promoter 
region derived from the psbA gene for encoding the Dl protein 
3 0 from C. reinhardtii further containing a transcription 

initiation site (TS) , the RB47 binding site, a region for 

RECTIFIED SHEET (RULE 91) 
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insertion of a foreign or heterologous coding region, a RB47 
coding region, and the 3- flanking region containing 
transcription termination site (TS) . Restriction endonuclease 
sites for facilitating insertion of the independent genetic 
elements are indicated and further described in Example 4E. 

Figure 7 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 ' to 3 • , a promoter 
region derived from the psi^A gene for encoding the Dl protein 
from C. reinhardtii further containing a transcription 
initiation site (TS) , the RB47 binding site, a region for 
insertion of a foreign or heterologous coding region, and the 3- 
flanking region containing transcription termination site (TS) . 
Restriction endonuclease sites for facilitating insertion of the 
independent genetic elements are indicated and further described 
in Example 4G. 

Figure 8 is a Western blot of a tetanus toxin single chain 
antibody expressed with a construct of the present invention as 
further described in Example 4G1) . 

Figure 9 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 • to 3 ■ , a promoter 
region derived from the psJbA gene for encoding the Dl protein 
from C. reinhardtii further containing a transcription 
initiation site (TS) , the RB47 binding site, a region for 
insertion of a coding sequence of bacterial luciferase A and B 
proteins including the translation termination codon TAA. The 
3' flanking region containing transcription termination site 
(TS) . Restriction endonuclease sites for facilitating insertion 
of the independent genetic elements are indicated and further 
described in Example 4G2) . 

Figure 10 illustrates the accumulation of expressed 
bacterial luciferase protein in the chloroplast as further 
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10 



described in Example 4G2) . 

Figure 11 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 ' to 3 • , a promoter 
region derived from the psjbA gene for encoding the Dl protein 
5 from C. reinhardtii further containing a transcription 

initiation site (TS) , the RB47 binding site, a region for 
insertion of a foreign or heterologous coding region for dimeric 
IgA (dIgA) and the 3' flanking region containing transcription 
termination site (TS) . Restriction endonuclease sites for 
facilitating insertion of the independent genetic elements are 
indicated and further described in Example 4G3) . 



Detailed Description of thf- invent Ion 
15 A, Definitions 

TABLE OF CORRESPONDENCE 



20 



25 



30 



Code 


Group 


Nucleoti f 


A 


A 


adenine 


C 


C 


cytosine 


G 


G 


guanine 


T 


T 


thymine (in DNA) 


U 


U 


uracil (in RNA) 


Y 


C or T{U) 


pyrimidine 


R 


A or G 


purine 


M 


A or C 


amino 


K 


G or T(U) 


keto 


S 


G or C 


strong interaction (3 hydrogen bonds) 


W 


A or T(U) 


weak interaction (2 hydrogen bonds) 


H 


A or C or T(U) 


not-G 


B 


G or T{U) or C 


not -A 
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V G or C or A not-T or not-U 

D G or A or T(U) not-C 
N G, A, C or T(U) any 

l^ixxQ Ariel R^si^aye : An amino acid formed upon chemical 
digestion (hydrolysis) of a polypeptide at its peptide 
linkages. The amino acid residues described herein are 
preferably in the "L" isomeric form. However, residues 
in the "D" isomeric form can be substituted for any L- 
amino acid residue, as long as the desired functional 
property is retained by the polypeptide. NH^ refers to 
the free amino group present at the amino terminus of a 
polypeptide. COOH refers to the free carboxy group 
present at the carboxy terminus of a polypeptide. In 
keeping with standard polypeptide nomenclature 
(described in J. Biol rh^j^ , 243:3552-59 (1969) and 
adopted at 37 CFR §1 . 822 (b) (2 ) ) , abbreviations for amino 
acid residues are shown in the following Table of 
Correspondence : 



TABLE OF CORRESPONDENCE 
— SYMBOL AMTMn arrn 


1-Letter 


3-Letter- 




Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


He 


isoleucine 


L 


Leu 


leucine 
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5 



10 



20 



T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/or Gin 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


. X 


Xaa 


unknown / o t he r 


ion the 


following have the meanings below: 


BOC 


tert - butyl oxy car bony 1 




DCCI 


di cyl cohexylcarbodi imide 


DMF 


dimethylformamide 




OMe 


methoxy 




HOBt 


l-hydroxybezotriazole 





It should be noted that all amino acid residue 
sequences are represented herein by formulae whose left 

25 and right orientation is in the conventional direction 
of amino- terminus to carboxy- terminus . Furthermore, it 
should be noted that a dash at the beginning or end of 
an amino acid residue sequence indicates a peptide bond 
to a further sequence of one or more amino acid 

30 residues. 

PQlypgptricjfi : A linear series of amino acid 
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residues connected to one another by peptide bonds 
between the alpha-amino group and carboxy group of 
contiguous amino acid residues. 

Peptide : A linear series of no more than about 50 
amino acid residues connected one to the other as in a 
polypeptide. 

PJTQtein : A linear series of greater than 50 amino 
acid residues connected one to the other as in a 
polypeptide . 

Synthetic peptide : A chemically produced chain of 
amino acid residues linked together by peptide bonds 
that is free of naturally occurring proteins and 
fragments thereof. 

Nucleotide: A monomeric unit of DNA or RNA 
consisting of a sugar moiety (pentose), a phosphate, and 
a nitrogenous heterocyclic base. The base is linked to 
the sugar moiety via the glycosidic carbon (l- carbon of 
the pentose) and that combination of base and sugar is a 
nucleoside. When the nucleoside contains a phosphate 
group bonded to the 3 ' or 5 • position of the pentose it 
is referred to as a nucleotide. A sequence of 
operatively linked nucleotides is typically referred to 
herein as a "base sequence" or "nucleotide sequence", 
and their grammatical equivalents, and is represented 
herein by a sequence whose left to right orientation is 
in the conventional direction of 5 '-terminus to 3 ' - 
terminus . 

Base Pair (bp) : A partnership of adenine (A) with 
thymine (T) , or of cytosine (C) with guanine (G) in a 
double stranded DNA molecule. in RNA, uracil (U) is 
substituted for thymine. 
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Nucleic Acid: A polymer of nucleotides, either 
single or double stranded. 

PolYnucleot-ide: A polymer of single or double 
stranded nucleotides. As used herein "polynucleotide" 
and its grammatical equivalents will include the full 
range of nucleic acids. A polynucleotide will typically 
refer to a nucleic acid molecule comprised of a linear 
strand of two or more deoxyribonucleotides and/or 
ribonucleotides. The exact size will depend on many 
factors, which in turn depends on the ultimate 
conditions of use, as is well known in the art. The 
polynucleotides of the present invention include 
primers, probes, RNA/DNA segments, oligonucleotides or 
"oligos" (relatively short polynucleotides), genes, 
vectors, plasmids, and the like. 

Gene: A nucleic acid whose nucleotide sequence 
codes for an RNA or polypeptide. A gene can be either 
RNA or DNA. 

Duplex DIJA: A double- stranded nucleic acid 
molecule comprising two strands of substantially 
complementary polynucleotides held together by one or 
more hydrogen bonds between each of the complementary 
bases present in a base pair of the duplex. Because the 
nucleotides that form a base pair can be either a 
ribonucleotide base or a deoxyribonucleotide base, the 
phrase "duplex DNA" refers to either a DNA-DNA duplex 
comprising two DNA strands (ds DNA) , or an RNA-DNA 
duplex comprising one DNA and one RNA strand. 

Complementary Baap^fg- Nucleotides that normally 
pair up when DNA or RNA adopts a double stranded 
configuration . 
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Completnentarv Nucleotide Seauencf^ : A sequence of 
nucleotides in a single -stranded molecule of DNA or RNA 
that is sufficiently complementary to that on another 
single strand to specifically hybridize to it with 
consecjuent hydrogen bonding. 

Recombinant DNA (rPNA) mnlRnnl^- a DNA molecule 
produced by operatively linking two DNA segments. Thus, 
a recombinant DNA molecule is a hybrid DNA molecule 
comprising at least two nucleotide sequences not 
normally found together in nature. rDNA's not having a 
common biological origin, i.e., evolutionarily 
different, are said to be "heterologous". 

VectQ^r: A rDNA molecule capable of autonomous 
replication in a cell and to which a DNA segment, e.g., 
gene or polynucleotide, can be operatively linked so as 
to bring about replication of the attached segment. 
Vectors capable of directing the expression of genes 
encoding for one or more polypeptides are referred to 
herein as "expression vectors" . 

Conserve^ : A nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
randomly hybridizes to an exact complement of the 
preselected sequence. 

HyIpri<jiza1;iQn: The pairing of substantially 
complementary nucleotide sequences (strands of nucleic 
acid) to form a duplex or heteroduplex by the 
establishment of hydrogen bonds between complementary 
base pairs. It is a specific, i.e., non-random, 
interaction between two complementary polynucleotides 
that can be competitively inhibited. 

Nucleotide Analog: A purine or pyrimidine 
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nucleotide that differs structurally from A, G, or 

but is sufficiently similar to substitute for the 
normal nucleotide in a nucleic acid molecule. 

Upfftr^am: In the direction opposite to the 
5 direction of DNA transcription, and therefore going from 
5' to 3 ■ on the noncoding strand, or 3 ' to 5 » on the RNA 
transcript . 

Dovmstreain : Further along a DNA sequence in the 
direction of sequence transcription or read out, that 
10 is, traveling in a 3 to 5 ' -direction along the 

noncoding strand of the DNA or 5 • - to 3 » -direction along 
the RNA transcript. 

Stop Codon: Any of three codons that do not code 
for an amino acid, but instead cause termination of 
15 protein synthesis. They are UAG, UAA and UGA and are 
also referred to as a nonsense, termination, or 
translational stop codon. 

Reaching Fy^me : Particular sequence of contiguous 
nucleotide triplets (codons) employed in translation. 

2 0 The reading frame depends on the location of the 

translation initiation codon. 

Homolog: Refers to a molecules that is 
structurally or functionally equivalent to a molecule of 
the present invention. 
25 Fusion Protein : A polypeptide produced by 

recombinant DNA methods in which a first polypeptide 
domain is operatively linked to a second polypeptide 
domain by the peptide bond produced through expression 
of a single open reading frame to express a single 

3 0 "fused" polypeptide. 

Chimeric Molequle: A bifunctional molecule formed 
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by connecting two separate molecules through chemical 
linkage, such as by crosslinking two isolated 
polypeptides or joining two heterologous fragments of 
DNA from different sources . 

5 

B. Tr^nglatiQA^l R^g^l^tion by RB47 ^nd RB6Q 
1. RB47 ^nd Rg47 Binding Site 

The present invention is based primarily on 
the discovery of that RB47, a mRNA binding protein, is a 

10 translational activator of the chloroplast psbA mRNA 

from Chlamydomonas reinhardtii , a green algae. The role 
of RB47 is now clearer, as evidence for its function 
comes from several independent sources. First, 
biochemical analysis has shown that this protein (along 

15 with other proteins within the complex) binds with high 
affinity and specificity to the psJbA 5' UTR in a manner 
consistent with a role in translational regulation; high 
levels of binding are observed during high translation 
levels in the light, and low levels of binding are 

20 observed when translation is low in the dark. Second, 

the predicted amino acid sequence of RB47, now available 
since the cloning of RB47 as described herein, indicates 
the role this protein plays in translation. RB47 
belongs to a family of proteins known as poly (A) binding 

25 proteins that bind RNA and have been shown to play a 
role in translation initiation (Bag and Wu, Eur . J . 
Biochem. . 237:143-152 (1996); de Melo Neto et al . , Nuc . 
Acids Res . . 23:2198-2205 (1995); Proweller and Butler, 
J. Biol. Chem. , 271:10859-10865 (1996); Sachs and Davis, 

30 Cell . 58:857-867 (1989); Tarun and Sachs, Embo J. . 

15:7168-7177 (1996)). Finally, genetic analysis has 
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predicted translational activators of chloroplast mRNAs 
(Girard-Bascou et al . , Curr. G^nRt ^ ^ 22:47-52 (1992); 
Kuchka et al . , Embo J. , 7:319-324 (1988); Rochaix et 
al-/ Ann. Rev, Q^H BjQ} ,, 8:1-28 (1989); and Yohn et 
al./ Mol> CeU. BjQl,, 16:3560-3566 (1996)). With the 
use and characterization of the hfl4 9 mutant, a high 
fluorescence phenotype of C. reinhardtii, the absence of 
RB47 has been shown to correspond directly to the loss 
of translational initiation of the psbA mRNA, thus 
defining RB47 as a translational activator of the psJbA 
mRNA. This is further supported by an additional 
nuclear mutation in C. reinhardtii (hf261) which is 
non-allelic to hfl49, but shows the similar phenotype of 
a specific loss of Dl translation. The RB47 protein 
accumulates to less than 10% of the wild type level in 
this mutant . 

While proteins which bind to the 5 ' UTR of 
chloroplast mRNAs seem likely candidates for 
translational activators, no direct link had been made 
to the body of genetic data prior to the 

characterization of the hfl4 9 mutant as described in the 
present invention. Thus, the identification of RB47, 
the cloning thereof, and the role of RB47 in translation 
activation of psbA are novel and form the basis of the 
mechanisms of the expression cassettes of the present 
invention. 

hfl49 is not likely to be a mutation directly in 
the RB4 7 gene, as Southern and Northern analysis 
indicates that the RB4 7 gene is intact and produces 
normal amounts of RB47 mRNA in the hfl4 9 strain. This 
leaves open the possibility that the loss of RB47 
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protein is the result of a loss of psbA translation, 
rather than the cause of it. Although this is a formal 
possibility, it is highly unlikely given the fact that 
the RB4 7 protein accumulates in other psbA translation 
initiation deficient mutants (e.g. F35, Yohn et al . , 
Mol. Cell Rin1 , 16:3560-1566 (1996)), and that the psbA 
RB60 RNA binding protein still accumulates in the hfl49 
strain. Thus, the hfl4 9 mutation provides strong 
evidence that the RB47 protein is directly involved in 
translational regulation of the chloroplast encoded psjbA 
mRNA. Identification of the specific defect in the 
hfl4 9 mutant should yield further insights into this 
process . 

The dramatic reduction in the amount of psbA mRNA 
associated with ribosomes in the hfl49 mutation suggests 
that RB4 7 is specifically required for ribosomes to 
initiate translation with the psbA mRNA. Although the 
identification of a message specific translational 
activator in the chloroplast has not previously been 
shown, other organellar systems may use similar 
mechanisms for controlling and coordinating gene 
expression, most notably the mitochondria of yeast. In 
particular, the C0X3 mRNA of Saccharomyces cerevisiae is 
translationally regulated by a complex of at least three 
proteins which have been shown genetically (Wiesenberger 

al., Mol. Cell Biol. . 15:3291-3300 (1995)) and 
biochemically (Brown et al . , Mol. Cell o1 , , i 4 - 1 n^R- 
1053 (1994)) to interact with each other and with the 
C0X3 mRNA. One of these proteins (PET122) also 
interacts with the mitochondrial ribosome (Haffter et 
al., Genetics, 127:319-326 (1991); Haffter et al . , 
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Genetics . 12 5 : 495-503 (1990); McMullin et al . , Mol . Cell 
Biol ■ . 10:4590-4595 (1990)), suggesting a model for 
translational activation in which these proteins 
facilitate the initial interaction between the mRNA and 
5 the ribosome. A similar mechanism may be involved with 
RB47, the psbA mRNA and chloroplast ribosomes . 

The identification of RB47 as a poly (A) binding 
protein (PABP) is somewhat unexpected given that 
translation in the chloroplast is generally considered 

10 prokaryotic like, and PABPs have not been identified as 
components of the prokaryotic translation apparatus. 
The chloroplast has 70S ribosomes (as in prokaryotes) 
and the mRNAs encoded by the chloroplast genome do not, 
in general, have poly (A) tails, and often contain 

15 prokaryotic consensus ribosome binding sequences 

(Gillham et al . , Ann. Rev. Genetics . 28:71-93 (1994); 
Harris et al., Microbiol. Rev. . 58:700-754 (1994)). The 
addition of A- rich sequences to the 3 * end of 
endonucleolytic cleavage products of some chloroplast 

20 mRNAs has recently been described (Kudla et al . , Embo 
J. . 15:7137-7146 (1996); Lisitsky et al . , Proc . Natl. 
Acad. Sci. . USA . 93:13398-13403 (1996)), and this seems 
to play a role in degradation of the RNA, as in 
prokaryotes. The identification of a PABP in the 

25 chloroplast indicates that components of the cytoplasmic 
translation machinery may have been appropriated by the 
chloroplast for a similar function. These data also 
indicate that PABPs may function in translational 
regulation in the chloroplast in a manner not previously 

3 0 described for cytoplasmic mRNAs, although the role of 
RB47 in pshA translation seems to fit with the limited 
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information known about the function of PABPs in other 
systems. While no specific biochemical function has yet 
been identified for any member of the PABP family, these 
proteins have been defined as specific RNA binding 
5 proteins with a role in translational regulation. In 
yeast, PABP is essential for viability (Sachs et al . , 
Cell . 45:827-835 (1986); Sachs et al . , Mol . Cell 
Bioi^, 7 : 3268-3276 (1987)), and a temperature sensitive 
(ts) allele of PABP shows that depletion of PABP in 

10 yeast results in inhibition of translation initiation 
and poly (A) tail shortening (Sachs and Davis, Cell . 
58:857-867 (1989)). Further, revertants of this ts 
mutation mapped to a ribosomal protein, suggesting that 
PABP interacts with the ribosome to mediate translation 

15 initiation (Sachs and Davis, id, . (1989)). In addition, 
PABPs have been shown to physically interact with 
ribosomes (Prowell er and Butler, supra , (1996) ) , and 
with eukaryotic initiation factors (eIF4G) (Tarun and 
Sachs, supra , (1996)). RB47 appears to fit these 

2 0 general roles predicted for PABPs, with the exception 
that RB47 shows specific binding to the 5' UTR of the 
psJbA mRNA, and that RB47 is acting in the chloroplast, 
where translation is distinct from that in the cytoplasm 
both spatially and mechanistically. However, the fact 

2 5 that this nuclear encoded, eukaryotic protein has been 

exploited for use in the chloroplast may not be too 
surprising given the bi-directional exchange of genetic 
information between the chloroplast and nucleus (Morden 
et al., Biosystems , 28:75-90 (1992)). 

3 0 Thus, in view of the binding specificity of RB47 to 

the RB4 7 binding site in psbA mRNA, the present 
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invention is unique in describing expression cassettes 
regulated at the translational level. 

From the genetic, biochemical and molecular 
analysis of translational regulation in the chloroplast, 
5 a model for how the psbA mRNA binding proteins act in 
translation initiation and activation of psbA mRNA is 
presently formulated. In this model, nuclear encoded 
proteins, including the PABP homologue RB4 7, are 
transported into the chloroplast. Once in the plastid 

10 these proteins are activated to bind to RNA elements 
found within the 5' UTR of specific mRNAs (Mayfield et 
al., J. Cell Biol. . 127:1537-1545 (1994)). This 
activation of binding is light responsive via the 
reducing potential generated by the light reactions of 

15 photosynthesis (Danon and Mayfield, Embo J. . 13:2227- 
2235 (1994)). The interaction of the translational 
activator proteins and cis-acting RNA elements 
facilitates the initial interaction of the message with 
ribosomal subunits, resulting in increased translation 

20 of the Dl protein from the psbA mRNA, 

Thus, in view of the enhancement of translation by 
the binding of a translation activator protein on a RNA 
element and in view of the cloning of both RB4 7 and RB60 
translation activator proteins, the present invention 

25 contemplates the following aspects related to expression 
systems and uses thereof : isolated nucleic acids 
encoding x~ecombinant proteins and variations thereof; 
the recombinant proteins themselves; use of the RNA 
binding site element in concert with the RB47 and RB60 

3 0 nucleic acids and proteins thereof including 

endogenously expressed counterparts; expression 
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cassettes in which the genetic elements of this 
invention are operably linked; expression systems 
including cells in vitro and in vivo; methods of use 
thereof for expressing a heterologous molecule and for 
5 screening for agonists and antagonists of the 

interaction on which the present invention is based; and 
lastly, kits for use in expression of proteins and 
preparation of RNA transcripts. 

The present invent ion therefore describes the use 

10 of an RB47 binding site nucleotide sequence and a 

coordinate RB47 binding site molecule for the purpose of 
enhancing translation of a desired heterologous coding 
sequence, thereby producing the desired expressed 
molecule for use thereafter. 

15 Based on the translational activation mediated 

through the binding of an RB47 binding site sequence, 
typically a mRNA sequence, the present elements of RB47 
binding site and an RB47 binding site polypeptide are 
therefore referred to as a translational activation 

20 system. The system is capable of further modulation or 
regulation by a polypeptide that regulates the binding 
of a RB47 binding site interaction with its activator 
protein as discussed in the next section. In a 
preferred embodiments, the translational activator 

25 protein of RB47 binding site is RB47 and the translation 
regulatory molecule that regulates the binding of an 
activator to an RB47 binding site is RB60, the latter of 
which is discussed below. 

Thus, a molecule that binds RB4 7 and results in the 

30 translational activation of RB47 binding site thereby 
enhancing translation of a desired mRNA sequence is 
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referred to as a RB47 binding site polypeptide. 
Preferably, the polypeptide is RB47 that is present 
endogenously, i.e., naturally occurring, in a cell such 
that activation of a RB4 7 binding site occurs through an 
5 interaction of an endogenous protein with an exogenously 
provided RB4 7 binding site sequence as further described 
below. Functional RB4 7 protein is found in plant 
chloroplasts as reviewed above. 

In other embodiments, RB47 is a recombinant protein 

10 produced through the expression of the coding sequence 
in a recipient cell as discussed in Section C below. 
Expression of a recombinant RB47 is now possible in view 
of the cloning of the cDNA encoding of RB47 as described 
in the present invention and in Examples 2 and 3 . 

15 Exemplary recombinant RB47 proteins produced by the 

methods of this invention, more completely described in 
the Examples, include mature or processed RB4 7 that is 
approximately a 47 kilodalton (kDa) protein, precursor 
or unprocessed RB4 7 that is approximately a 69 kDa 

20 protein, and a histidine-modif ied RB47 protein that is 

also approximately a 47 kDa protein, the latter of which 
is useful for purification aspects as described in the 
Examples . 

Thus, although the preferred RB4 7 proteins and 
25 nucleic acid compositions are derived from Chlamydomonas 
reinha.rdtii as discussed in the present invention, 
variations at both the amino acid and nucleotide 
sequence level may exist in similar functioning 
molecules isolated from different algae species as well 
3 0 as within differing plant geni . Such variations are not 
to be construed as limiting. For example, allelic 
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variation within a plant species can tolerate a several 
percent difference between isolates of a type of RB47, 
the differences of which comprise non-deleterious 
variant amino acid residues. Thus a protein of about 
5 95% homology, and preferably at least 98% homology, to a 
disclosed RB47 protein is considered to be an allelic 
variant of the disclosed RB47 protein, and therefore is 
considered to be a RB4 7 protein of this invention. 

Thus, the term "homolog" refers to any RB47-like 

10 protein or polypeptide having similar three-dimensional 
structure based on the amino acid residue sequence that 
can be encoded by differing specific nucleic acid 
sequences. In other words, the RB47 species of this 
invention are homologous molecules in view of the amino 

15 acid sequence similarity, the presence of a species 
specific sequence, the overall secondary and tertiary 
structure of the molecule, and the like physical 
parameters . 

Thus as used herein, the phrases "RB47 protein" and 
20 ^^RB47 peptide or polypeptide" refers to a RB47 molecule 
having an amino acid residue sequence that comprises an 
amino acid residue sequence that corresponds, and 
preferably is identical, to a portion of a RB47 protein, 
either produced endogenously or exogenously to produce 

2 5 recombinant proteins, of this invention. 

A recombinant RB47 protein need not necessarily be 
substantially pure, or even isolated, to be useful in 
certain embodiments, although recombinant production 
methods are a preferred means to produce a source for 

3 0 further purification to yield an isolated or 

substantially pure receptor composition. A recombinant 
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RB47 protein can be present in or on a mammalian cell 
line or in crude extracts of a mammalian cell line. In 
other embodiments, a recombinant RB47 protein is 
produced in or on plants or plant cell lines, for 
subsequent use therein to activate the translation of a 
desired coding sequence as described in Section C. 
Preferred expression vector systems for production of 
RB4 7 proteins of this invention in this context are 
described in Section C and in the Examples. 

In the context of the present translational 
activation system of this invention, the presence of a 
RB4 7 binding site sequence is required. Thus, a RB47 
binding site sequence is referred to as a translational 
activation binding domain, the activation of which leads 
to the enhanced or increased translation of a desired 
coding sequence. As previously discussed, endogenous 
activation of the RB4 7 mRNA binding sequence in the psjbA 
gene in green algae chloroplast by RB4 7 results in the 
expression of Dl protein. As further discussed below, 
this translation activation can be further modulated or 
regulated by RB60. The nucleotide sequence in the 5' 
untranslated (5' UTR) end of the psjbA gene containing 
the RB47 binding site is described in Example 3. The 
use of the RB4 7 binding site sequence thus is 
contemplated for use in preparing an expression cassette 
of this invention as further described in Section C and 
more completely in the Examples. Insofar as the binding 
of a RB47 binding site polypeptide to the nucleotide 
sequence of the RB4 7 binding site allows for 
translational activation of an encoding mRNA, 
variations, substitutions, additions, deletions and the 
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like permutations in the nucleic acid sequence of the 
RB4 7 nucleic acid sequence are contemplated for use in 
the present invention. In addition, any functional RB47 
binding site nucleotide sequence is generally positioned 
5 upstream, i.e., 5', to the desired coding nucleotide 
sequence and in relation to the other inserted genetic 
elements including an upstream promoter, transcription 
initiation sites and downstream translation initiation 
sites of a coding region that can be a desired coding 
10 sequence or one of the genetic control elements of the 

invention, such as RB4 7 or RB6 0, as further described in 
Section C. and in the Examples. 

2 . RB60 

15 Light -regulated translation of chloroplast 

mRNAs requires trans-acting factors that interact with 
the 5» untranslated region (UTR) of these mRNAs. The 
present invention describes a protein disulfide 
isomerase (PDI) , also referred to as RB60, that is 

20 localized to the chloroplast and co-purifies with cPABP. 
The cDNA encoding the RB60 protein has now been cloned 
as described herein. As described more fully below, the 
RB60 protein has now been shown to modulate the binding 
of RB47, the cPABP, to the 5» UTR of the psbA mRNA by 

25 reversibly changing the redox status of cPABP using 

redox potential or ADP-dependent phosphorylation. This 
mechanism allows for a simple reversible switch 
regulating gene expression in the chloroplast. 
Moreover, in view of the modulatory properties of RB60 

30 as discussed below, incorporation of RB60 into the 

compositions and methods of this invention are valuable 
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for regulating the expression of a desired gene product 
with the expression cassettes and systems of the present 
invention as described further herein and in the 
Examples . 

5 The present inventors have determined the role of 

RB60 in regulating the binding of RB47 to psJbA mRNA 
containing the RB47 binding site. The work has recently 
been published, Kim and Mayfield, Science . 278:1954-1957 
(1997) , references for the RB60 section herein are 

10 provided in the published paper. As previously 
discussed, synthesis of certain chloroplast 
photosynthetic proteins is activated 50-100 fold in 
response to light exposure without an increase in the 
corresponding mRNA levels, indicating that translation 

15 of chloroplast mRNAs is light-regulated. Genetic 

evidence has shown that nuclear-encoded trans-acting 
factors interact with the 5 ' untranslated region (UTR) 
of chloroplast mRNAs to activate translation of these 
mRNAs in a light -dependent fashion. A set of proteins 

20 (38, 47, 55 and 60 kDa) was identified to bind as a 
complex to the 5' UTR of the psbA mRNA, encoding the 
photosynthetic reaction center protein Dl from the green 
algae Chlamydomonas reinhardtii . Binding of this 
protein complex to the 5 ' UTR of the psbA mRNA 

25 correlates with light -enhanced translation of this mRNA 
under a variety of environmental condition, and in 
mutations deficient in psbA mRNA translation. RNA 
binding activity of the protein complex for the 5 ' UTR 
of the psbA mRNA can be regulated in vitro by at least 

30 two different mechanisms: ADP-dependent phosphorylation 
and changes in redox potential. 
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The present invention and the Kim and Mayfield, 
jjL., (1997) paper describe the cloning of the cDNA 
encoding the 60 kDa psbA mRNA binding protein (RB60) as 
further described in the Examples. The predicted amino 
5 acid sequence of the cloned cDNA is also described 
therein. 

To verify that RB60 is localized to the 
chloroplasts, an immunoblot analysis of isolated pea 
chloroplasts was performed using the C, reinhardtii RB60 

10 antiserum. To confirm that the isolated pea 

chloroplasts were free of cytoplasmic contamination, 
immunoblot analysis was performed with antiserum against 
the large subunit of ribulose bisphosphate carboxylase 
(RuBPCase, located in chloroplast) and antiserum against 

15 the cytoplasmic protein tubuli. RuBPCase antiserum 
recognized proteins from both whole leaf extracts 
(cytoplasm plus chloroplast) and from isolated 
chloroplasts. The tubulin antiserum recognized a 
protein in whole leaf extracts, but not in the 

2 0 chloroplast fraction) , showing that the isolated 

chloroplasts were free of cytoplasmic proteins. The 
protein extracts from isolated pea chloroplasts were 
enriched using heparin-agarose chromatography: 
enrichment was required for immunoblot assays with the 
25 RB60 antiserum as RB6 0 is a minor component within the 
chloroplast. Immunoblot analysis was performed on 
proteins from purified pea chloroplasts, from. C. 
reinha.rdtii cell extracts isolated by heparin-agarose 
chromatography, and on recombinant RB60. A specific 

3 0 signal immunochemically related to RB60 was clearly 

detected at approximately 63 kDa in the pea chloroplast 
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sample. A signal of equal intensity was observed for C. 
reinhardtii proteins and for the recombinant RB60. 

Chloroplast PDI (cPDI) contains the two -CGHC- 
catalytic sites that are involved in the formation, 
5 reduction and isomerization of disulfide bonds 

associated with protein folding. The identification of 
these redox catalytic sites prompted the investigation 
of the role of RB60 in the redox- regulated binding of 
RB47 to the 5* UTR of the psbA mRNA. Both RB60 and 

10 RB47, containing only the four RNA recognition motif 
domains, were expressed as further described in the 
Examples in E. coli as a fusion protein with a histidine 
tag, purified on a Ni-NTA agarose affinity column and 
used for subsequent RNA binding gel mobility-shift 

15 assays. The effect of a reducing agent on RNA binding 
activity of recombinant RB47 (r-RB47) was assessed by 
the addition of DTT (dithiothreitol) in the presence of 
recombinant RB60 {r-RB60) . r-RB47 was preincubated with 
10 mM DTT, a 5-fold excess of r-RB60 alone, or both DTT 

20 plus r-RB60, prior to adding a ^^P-labeled 5 ' -UTR of the 
psbA mRNA, followed by a gel mobility- shift assay. The 
results showed that r-RB47 isolated from coli was in 
an active reduced form so that only a slight enhancement 
of RNA binding activity was obtained with addition of 

25 DTT and r-RB60. 

To determine whether r-RB60 was able to re-activate 
r-RB47 that was in an inactive oxidized form, r-RB47 v/as 
incubated with the oxidant dithionitrobenzoic acid 
(DTNB) for 5 minutes and then dialyzed against 10^ volume 

30 of buffer to remove the oxidant. Oxidation of r-RB47 by 
DTNB completely abolished the binding activity of the 
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protein. Addition of DTT to 1.0 mM partially restored 
the binding capacity of r-RB4 7, and the binding was 
increased three fold by the addition of up to 25 mM DTT. 
With increasing amounts of r-RB60, the binding activity 
5 of r-RB4 7 was increased compared to the samples without 
r-RB60 at every level of DTT tested. When DTT was not 
present in the incubation medium, r-RB60 alone could not 
restore the binding of the oxidized r-RB47 (0 mM DTT) , 
indicating that r-RB60 requires reducing equivalents to 

10 convert the inactive oxidized form of r-RB4 7 to an 
active reduced form. 

Protein disulfide isomerase is known to catalyze 
the formation of disulfide bonds by oxidation of the 
sulfhydryl groups of cysteine residues during protein 

15 folding. To examine whether r-RB60 was also capable of 
oxidative catalysis of the reduced form of r-RB47, GSSG, 
the oxidized form of the thiol tripeptide glutathionine, 
was added to the assay mixture . When GSSG alone was 
added to r-RB47 at up to 5 mM, there was a two fold 

2 0 reduction in binding activity of r-RB47 compared with 

untreated protein. Incubation of r-RB4 7 with both GSSG 
and r-RB60 reduced the binding activity of r-RB47 by 5-6 
fold, indicating that r-RB60 can facilitate the 
conversion of the reduced form of r-RB47 to an inactive 
25 oxidized form under an oxidizing environment. Thus, 
RB60 modulates or in other words regulates the redox 
potential essential for RB47 binding activity. As such, 
RB60 is a regulatory protein useful in regulating the 
expression of a desired coding sequence in reducing and 

3 0 oxidizing environments as supported by the teachings 

described herein. 
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ADP- dependent phosphorylation of RB6 0 has 
previously been shown to reduce binding of the protein 
complex to the 5 ' -UTR of the pshA mRNA, To identify if 
recombinant RB60 can be phosphorylated, r-RB60 was 
5 incubated with heparin-purif led proteins from C. 

reinhardtii in the presence of g-^^p.^^^P. Phosphorylated 
r-RB6 0 was detected among a number of phosphorylated 
proteins in the heparin-purif ied fraction. Purification 
of the incubation mixtures on Ni-NTA resin resulted in 

10 the isolation of phosphorylated r-RB6G. Phosphorylated 
r-RB60 was able to reduce the binding of r-RB47 to the 
5' UTR of the psjbA mRNA, whereas, phosphorylated C. 
reinhardtii proteins eluted from Ni-NTA resin had little 
impact on r-RB47 RNA binding. 

15 It has previously been shown that thioredoxin can 

act as a transducer of redox potential to enhance the 
binding of a protein complex to the psbA mRNA. PDI fits 
well into this scheme as ferredoxin- thioredoxin 
reductase is capable of directly reducing PDI . 

20 In a functional model of RB60 regulation, reducing 

equivalents, generated by photosynthesis, are donated to 
cPDI (RB60) through ferredoxin and 

ferredoxin- thioredoxin reductase and act to catalyze the 
reduction of chloroplast poly (A) binding protein (cPABP) 

25- (RB47) , The reduced form of cPABP is then capable of 
binding to the 5 • UTR of the psbA mRNA to activate 
translation initiation of this mRNA resulting in 
increased synthesis of the Dl protein. This mechanism 
provides a direct link in the chloroplast between the 

3 0 quantity of absorbed light and the rate of synthesis of 
the Dl protein, allowing the replacement of the 
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photo -damaged Dl protein. Protein disulfide isomerase 
has an additional advantage in this scheme in that it 
has greater oxidation potential than thioredoxin, thus 
allowing the off switch (oxidation) when reducing 
5 potential is low. ADP-dependent phosphorylation of 

RB60, that might be triggered by the increased pool of 
ADP during dark growth, can act to reduce the RNA 
binding activity of cPABP by enhancing the oxidative 
catalysis of cPDI over the reductive catalysis, 
10 resulting in decreased translation of the psbA mRNA. 
The data presented here show that a PDI such as RB60 
acts as a regulator of RNA binding activity and hence 
gene expression, and not just as a catalyst for protein 
folding . 

15 The present invention therefore describes the use 

of a protein disulfide isomerase, such as RB60, to 
function as a regulator of the binding of a RB47 binding 
polypeptide to the RB4 7 binding site nucleotide sequence 
for the activation of translation. Thus, in view of the 

2 0 foregoing disclosure, the use of a protein disulfide 

isomerase such as RB6 0 has many applicabilities in the 
context of the present invention, particularly ensuring 
translational control mechanisms for expression of a 
desired coding sequence and production of the encoded 

25 molecule in both oxidizing and reducing environments. 

Based on the translational activation mediated 
through the binding of an RB47 binding site sequence ^ 
typically a mRNA sequence and the regulation by the 
additional element of translational regulator, the 

30 present elements of RB47 binding site, an RB47 binding 
site polypeptide and a RB60 or like molecule are 
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therefore referred to as a regulated translational 
activation system. 

While the invention contemplates the use of any 
molecule that binds to RB47 binding site and any 
5 molecule that functions in accordance to the biological 
role of RB60 as described herein, in a preferred 
embodiments, the translational activator protein of RB47 
binding site is RB47 and the translation regulatory 
molecule that regulates the binding of an activator to 

10 an RB47 binding site is RB60. 

Preferably, a polypeptide that the regulates the 
binding of a separate polypeptide that binds to a RB47 
binding site is present endogenously , i.e., naturally 
occurring, in a cell such that activation and regulation 

15 of translation mediated through a RB47 binding site 

occurs through an interaction of an endogenous protein 
with an exogenously provided RB47 binding site sequence 
as further described in Section C below and in the 
Examples. Functional RB60 protein is found endogenously 

2 0 found in plant chloroplasts as reviewed above. 

In other embodiments, RB60 is a recombinant protein 
produced through the expression of the coding sequence 
in a recipient cell as discussed in Section C below. 
Expression of a recombinant RB60 is now possible in view 
25 of the cloning of the cDNA encoding of RB60 as described 
in the present invention and in Examples 2 and 3 . An 
exemplary recombinant RB60 protein produced by the 
methods of this invention is more completely described 
in the Examples . 

3 0 Thus, although the preferred RB60 protein and 

nucleic acid compositions are derived from Chlamydomonas 
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reinhardtii as discussed in the present invention, 
variations at both the amino acid and nucleotide 
sequence level may exist in similar functioning 
molecules isolated from different algae species as well 
as within differing plant geni. Such variations are not 
to be construed as limiting as previously discussed for 
RB47 compositions. 

C. Recombinant DNA Molecules and E xpression Systems 
that Utilize the RB47 Bin ding Site 
The invention describes several nucleotide 
sequences of particular use in the methods of 
controlling gene expression using the RB47 binding site. 
These sequences include the actual RB47 binding site, 
the sequences which encode the RB47 protein that binds 
to the RB4 7 binding site, the RB60 protein which 
regulates the activity of RB47 protein, and various DNA 
segments, recombinant DNA (rDNA) molecules and vectors 
constructed for expression of these protein or for using 
0 these proteins to control expression of preselected 
structural genes . 

DNA segments of this invention therefore can 
comprise sequences which encode whole structural genes, 
fragments of structural genes, and transcription units 
5 as described further herein. 

A preferred DNA segment is a nucleotide sequence 
which defines an RB47 binding site as defined herein, 
which defines an RB47 protein, RB47 polypeptide or 
biologically active fragment thereof, or which defines 
0 an RB60 protein, RB60 polypeptide or biologically active 
fragment thereof . 
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The amino acid residue sequence of RB47 and of RB60 
are described herein and in the Examples. 

A preferred DNA segment codes for an amino acid 
residue sequence substantially the same as, and 
5 preferably consisting essentially of, an amino acid 
residue sequence or portions thereof corresponding to 
the RB47 or RB60 protein described herein. 
Representative and preferred DNA segments are further 
described in the Examples . 

10 The amino acid residue sequence of a protein or 

polypeptide is directly related via the genetic code to 
the deoxyribonucleic acid (DNA) sequence of the 
structural gene that codes for the protein. Thus, a 
structural gene or DNA segment can be defined in terms 

15 of the amino acid residue sequence, i.e., protein or 
polypeptide, for which it codes. 

An important and well known feature of the genetic 
code is its redundancy. That is, for most of the amino 
acids used to make proteins, more than one coding 

20 nucleotide triplet (codon) can code for or designate a 
particular amino acid residue. Therefore, a number of 
different nucleotide sequences may code for a particular 
amino acid residue sequence. Such nucleotide sequences 
are considered functionally equivalent since they can 

25 result in the production of the same amino acid residue 
sequence in all organisms. Occasionally, a methylated 
variant of a purine or pyrimidine m.ay be incorporated 
into a given nucleotide sequence. However, such 
methylations do not affect the coding relationship in 

3 0 any way. 

A nucleic acid is any polynucleotide or nucleic 
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acid fragment, whether it be a polyribonucleotide of 
polydeoxyribonucleotide, i.e., RNA or DNA, or analogs 
thereof. In preferred embodiments, a nucleic acid 
molecule is in the form of a segment of duplex DNA, i.e, 
5 a DNA segment, although for certain molecular biological 
methodologies, single -stranded DNA or RNA is preferred. 

DNA segments are produced by a number of means 
including chemical synthesis methods and recombinant 
approaches, preferably by cloning or by polymerase chain 

10 reaction (PGR) . DNA segments that encode portions of an 
RB47 or RB60 protein can easily be synthesized by 
chemical techniques, for example, the phosphotriester 
method of Matteucci et al, J . Am . Chem . Soc . . 103:3185- 
3191, 1981, or using automated synthesis methods. In 

15 addition, larger DNA segments can readily be prepared by 
well known methods, such as synthesis of a group of 
oligonucleotides that define the DNA segment, followed 
by hybridization and ligation of oligonucleotides to 
build the complete segment. Alternative methods include 

20 isolation of a preferred DNA segment by PGR with a pair 
of oligonucleotide primers. 

Of course, through chemical synthesis, any desired 
modifications can be made simply by substituting the 
appropriate bases for those encoding the native amino 

25 acid residue sequence. 

Furthermore, DNA segments consisting essentially of 
structural genes encoding an RB4 7 or RB60 protein can be 
subsequently modified, as by site-directed or random 
mutagenesis, to introduce any desired substitutions. 

3 0 1. Glonina RB4 7 and RB60 Genes 

An RB47 or RB60 gene of this invention can be 
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cloned by a variety of cloning methods using 
Chlamydomonas reinhardtii (C. reinhardtii) as a source 
of the genomic DNA or messenger RNA (mRNA) for cloning 
purposes. Cloning these genes can be conducted according 
to the general methods described in the Examples. 

Preferred cloning strategies for isolating a 
nucleic acid molecule that encodes an RB47 or RB60 
protein of this invention are described in the Examples. 

Sources of libraries for cloning an RB47 or RB60 
gene of this invention can include genomic DNA or 
messenger RNA (mRNA) in the form of a cDNA library from 
a tissue believed to express these proteins. The 
preferred tissue is plant chloroplast from C. 
reinhardtii . 

A preferred cloning method involves the preparation 
a C. reinhardtii chloroplast cDNA library using standard 
methods, and preparing the RB47 or RB6 0- encoding 
nucleotide sequence using PGR with oligonucleotide 
primers based on the nucleotide sequences described 
herein for the RB47 or RB60 genes, respectively. 
Alternatively, the desired cDNA clones can be identified 
and isolated from a cDNA or genomic library by 
conventional hybridization methods using a hybridization 
probe based on the sequences described herein. Other 
methods are readily apparent to one skilled in the art. 

2 . Expression Vectors 

In addition, the invention contemplates a 
recombinant DNA molecule (rDNA) containing a DNA segment 
of this invention encoding an RB47 or RB60 protein as 
described herein. A rDNA can be produced by operatively 
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(operably) linking a vector to a DNA segment of the 
present invention . 

As used herein, the term "vector" refers to a DNA 
molecule capable of autonomous replication in a cell and 
5 to which another DNA segment can be operatively linked 
so as to bring about replication of the attached 
segment. A vector adapted for expression of a gene 
product and capable of directing the expression of a 
structural gene is referred to herein as an "expression 

10 vector". Thus, a recombinant DNA molecule is a hybrid 
DNA molecule comprising at least two nucleotide 
sequences not normally found together in nature. 

The choice of vector to which a DNA segment of the 
present invention is operatively linked depends 

15 directly, as is well known in the art, on the functional 
properties desired, e.g., protein expression, and the 
host cell to be transformed, these being limitations 
inherent in the art of constructing recombinant DNA 
molecules. However, a vector contemplated by the 

20 present invention is at least capable of directing the 
replication, and preferably also expression, of a 
structural gene included in DNA segments to which it is 
operatively linked. 

Both prokaryotic and eukaryotic expression vectors 

25 are familiar to one of ordinary skill in the art of 
vector construction and are described by Ausebel, et 
al . , In Current Protocols in Molecular Biology'-, Wiley 
and Sons, New York (1993) and by Sambrook et al . , 
Molecular Cloning: A Laboratory Manual, Cold Spring 

3 0 Harbor Laboratory, (1989) , which reference also 
describes all the general recombinant DNA methods 
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referred to herein. 

In one embodiment, a vector contemplated by the 
present invention includes a procaryotic replicon, i.e., 
a DNA sequence having the ability to direct autonomous 
replication and maintenance of the recombinant DNA 
molecule extrachromosomally in a procaryotic host cell, 
such as a bacterial host cell, transformed therewith . 
Such replicons are well known in the art. In addition, 
those embodiments that include a procaryotic replicon 
also include a gene whose expression confers drug 
resistance to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin or tetracycline. 

Those vectors that include a procaryotic replicon 
can also include a procaryotic promoter capable of 
directing the expression (transcription and translation) 
of a structural gene in a bacterial host cell, such as 
E. coli . transformed therewith. A promoter is an 
expression control element formed by a DNA sequence that 
permits binding of RNA polymerase and transcription to 
occur. Promoter sequences compatible with bacterial 
hosts are typically provided in plasmid vectors 
containing convenient restriction sites for insertion of 
a DNA segment of the present invention. Typical of such 
vector plasmids are pUC8 , pUC9 , pBR322 and pBR32 9 
available from Biorad Laboratories, (Richmond, CA) , 
pRSET available from Invitrogen (San Diego, CA) and pPL 
and pKK223 available from Pharmacia, Piscataway, N.J. 

Expression vectors compatible with eucaryotic 
cells, preferably those compatible with vertebrate 
cells, can also be used to form the recombinant DNA 
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molecules of the present invention. Eucaryotic cell 
expression vectors are well known in the art and are 
available from several commercial sources. Typically, 
such vectors are provided containing convenient 
5 restriction sites for insertion of the desired DNA 

segment. Typical of such vectors are pSVL and pKSV-10 
(Pharmacia) , pBPV-l/pML2d (International 
Biotechnologies, Inc.), pTDTl (ATCC, #31255), pRc/CMV 
(Invitrogen, Inc.), the preferred vector pcDNA3 

10 (Invitrogen) described in the Examples, and the like 
eucaryotic expression vectors. 

An alternative expression system that can be used 
to express a protein of the invention is an insect 
system. In one such system, Autographa calif ornica 

15 nuclear polyhidrosis virus (AcNPV) is used as a vector 
to express foreign genes. The virus grows in Spodoptera 
f rugiperda cells. The polypeptide -encoding nucleotide 
sequence may be cloned into non-essential regions (in 
Spodoptera f ruaiperda for example the polyhedron gene) 

20 of the virus and placed under control of an AcNPV 
promoter (for example the polyhedron promoter) . 
Successful insertion of the polypeptide -encoding 
nucleotide sequence inactivates the polyhedron gene and 
production of non-occluded recombinant virus (i.e., 

25 virus lacking the proteinaceous coat coded for by the 
polyhedron gene) . These recombinant viruses are then 
used to infect cells in v/hich the inserted gene is 
expressed. See Smith et al . , J. Biol . Chem. . 46:584 
(1983); and Smith, U.S. Patent No. 4,215,051. 

30 Mammalian cell systems that utilize recombinant 

viruses or viral elements to direct expression may be 
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engineered. For example, when using adenovirus 
expression vectors, the coding sequence of a polypeptide 
may be ligated to an adenovirus transcription/ 
translation control complex, e.g., the late promoter and 
5 tripartite leader sequence. This chimeric gene may then 
be inserted into the adenovirus genome by in vitro or in 
vivo recombination. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result 
in a recombinant virus that is viable and capable of 

10 expressing the polypeptide in infected hosts (e.g., see 
Logan et al . , Proc. Natl. Acad. Sci., USA . 81:3655-3659 
(1984)). Alternatively, the vaccinia virus 7.5K 
promoter may be used. (e.g., see, Mackett et al . , Proc . 
Natl. Acad. Sci.. USA , 79:7415-7419 (1982); Mackett et 

15 al. , J. Virol. . 49:857-864 (1984); Panicali et al . , 

Proc. Natl. Acad. Scj., USA , 79:4927-4931 (1982)). Of 
particular interest are vectors based on bovine 
papilloma virus which have the ability to replicate as 
extrachromosomal elements (Sarver et al . , Mol . Cell. 

20 Biol . . 1:486 (1981)) . Shortly after entry of this DNA 

into mouse cells, the plasmid replicates to about 100 to 
200 copies per cell. Transcription of the inserted cDNA 
does not require integration of the plasmid into the 
host's chromosome, thereby yielding a high level of 

2 5 expression. These vectors can be used for stable 
expression by including a selectable marker in the 
plasmid, such as the neo gene. Alternatively, the 
retroviral genome can be modified for use as a vector 
capable of introducing and directing the expression of 

30 the polypeptide-encoding nucleotide sequence in host 
cells (Cone et al . , Proc. Natl. Acad. Sci . . USA . 
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81:6349-6353 (1984)). High level expression may also be 
achieved using inducible promoters, including, but not 
limited to, the metallothionine IIA promoter and heat 
shock promoters. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. Rather than 
using expression vectors which contain viral origins of 
replication, host cells can be transformed with a cDNA 
controlled by appropriate expression control elements 
(e.g., promoter and enhancer sequences, transcription 
terminators, polyadenylation sites, etc.), and a 
selectable marker. As mentioned above, the selectable 
marker in the recombinant plasmid confers resistance to 
the selection and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci 
which in turn can be cloned and expanded into cell 
lines . 

For example, following the introduction of foreign 
DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 
selective media. A number of selection systems may be 
used, including but not limited to the herpes simplex 
virus thymidine kinase (Wigler et al . , Cell , 11:223 
(1977) ) , hypoxanthine -guanine phosphoribosyltransf erase 
(Szybalska et al, Proc . Natl. Acad. Sci . , USA . 48:2026 
(1962) ) , and adenine phosphoribosyltransf erase (Lowy et 
al . , Cell . 22:817 (1980)) genes, which can be employed 
in tk", hgprt" or aprt" cells respectively. Also, 
antimetabolite resistance-conferring genes can be used 
as the basis of selection; for example, the genes for 
dhfr, which confers resistance to methotrexate (Wigler 
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et al., Proc. Natl. Acad. Sci . , USA . 77:3567 (1980); 
O'Hare et al . , Proc. Natl. Acad. Sci . . USA . 78:1527 
(1981) ; gpt, which confers resistance to mycophenolic 
acid (Mulligan et al, Proc . Natl . Acad . Sci . , USA . 
5 78:2072, (1981)); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin et al, J. Mol . 
Biol . . 150:1 (1981)); and hygro, which confers 
resistance to hygromycin (Santerre et al. Gene . 30:147 
(1984)). Recently, additional selectable genes have 

10 been described, namely trpB, which allows cells to 
utilize indole in place of tryptophan; hisD, which 
allows cells to utilize histinol in place of histidine 
(Hartman et al , Proc. Natl. Acad. Sci.. USA . 85:804 
(1988) ) ; and ODC (ornithine decarboxylase) which confers 

15 resistance to the ornithine decarboxylase inhibitor, 
2- (dif luoromethyl) -DL-ornithine, DFMO (McConlogue L., 
In: Current Communications in Molecular Biology, Cold 
Spring Harbor Laboratory ed. , (1987)) . 

In another preferred embodiment, expression vectors 

2 0 compatible for use with plant cells are used to express 

structural genes in plants. Plants provide advantageous 
expression and delivery aspects in that a large supply 
of bulk protein with universal access is readily made 
from which the protein is either isolatable therefrom. 
25 Thus, transgenic plants containing expression vectors 
for encoding a recombinant protein of this invention is 
useful for preparing pol^-peptides of this invention. 

Typical expression vectors useful for expression of 
genes in plants are well known in the art. Typical 

3 0 methods for introducing genes via expression vectors 

into plants include Aarobacterium tumefaciens -mediated 
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transformation, plant virus transf ection, protoplast 
transformation, gene transfer into pollen, injection 
into reproductive organs, injection into immature 
embryos, and direct insertion, a process referred to as 
5 ^^biolystics" . In the case of infection by plant 

viruses, a recombinant protein can be produced at high 
concentrations and isolated at low cost, with the 
genetic stocks being easily maintained for long periods 
of time without passaging through plants. 
10 Preferred plants for such expression include any 

plant for which a compatible expression vector system 
exists, including dicots and monocots . Particularly 
preferred plants include alfalfa, tomato, petunia, soy 
bean, tobacco, corn, wheat, rice, spinach, asparagus, 
15 and the like. Exemplary plant expression vector systems 
for expression of a recombinant protein of this 
invention include those, such as binary vector system 
utilizing Aarobacterium tumefaciens , described in US 
Patent Number 5,202,422 and An et al . , ?l^nt MQlecula,r 
20 Biology Manual , A3:l-19 (1988). Additional plant 

expression systems are described in US Patent Numbers 
5,234,834, the disclosures of which several teachings 
are hereby incorporated by reference. 

25 D. Methods 

The present invention provides for a variety of 
methods using the disclosed r-ecombinant proteins, 
nucleotide sequences, expression cassettes and 
expression systems. In particular, the invention 

30 provides methods for preparing (expressing) an RB47 or 
RB60 protein using the expression systems, methods for 
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controlling (regulating) the expression of a preselected 
coding sequence using the translation regulation 
properties of the RB47 system, methods for screening for 
useful agents which affect the ability of RB47 and/or 
5 RB60 to regulate translation in these systems, and the 
like methods. 

1 . Methods for Preparing a Recombinant Protein 

An RB47 OR RB6 0 protein of this invention can 
10 be prepared by a variety of means, although expression 
using a rDNA expression vector is preferred. Exemplary 
production methods for a recombinant protein are 
described in the Examples . 

In one embodiment, the invention contemplates 
15 methods for the preparation of a recombinant RB4 7 or 

RB60 proteins in their various forms using a nucleotide 
sequence-based expression system. The produced proteins 
are useful in the various embodiments described herein. 
Although the description of expression is limited 

2 0 to specific examples, it is to be understood that the 

expression of proteins is generally characterized, and 
the expression of, for example, full length RB47 protein 
serves as an example of expression of any of a variety 
of forms of RB47 protein, including mature RB47, 
25 processed forms of RB47, biologically active fragments 
of RB47, fusion proteins containing RB47 domains, and 
the like. The descriptions herein apply to various 
forms of RB60 as well. 

The method of preparing a recombinant RB47 or RB60 

3 0 protein comprises providing an expression cassette as 

described herein that contains nucleotide sequences that 
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encode an RB4 7 or RB6 0 protein, or fragment thereof, 
together with nucleotide sequences that provide the 
requisite information for controlling gene expression 
and translation. The provided cassette is introduced 
into a suitable expression medium and maintained under 
conditions and for a time period sufficient for 
expression and translation of the protein product to 
occur. The times and conditions can vary, as is well 
known, depending upon the expression/translation medium 
(e.g., intracellular medium, in vitro expression medium, 
etc.). Nucleotide sequence information required for 
expression and translation are also well known in the 
art and need not be described in detail herein. 

A typical expression system is described herein, in 
5 which the expression cassette is present on a 

recombinant plasmid that has been introduced into a 
microbial host. For example, the expression cassette is 
present in a PET expression plasmid introduced into E. 
colir and the transformed bacterial is cultivated under 
0 growth conditions suitable for growth and expression of 
the expression cassette. Additional expression systems 
include other species of bacterial cells, yeast and 
eucaryotic cells, including mammalian cell expression 
systems, and in vitro expression systems, as are well 
5 known . 

After expression, the expressed RB60 or RB47 
protein is readily isolated from the expression medium 
(i.e., the host cell and cell contents) using standard 
biochemical separation methods to produce an isolated 
10 recombinant protein. Typical isolation methods can 
include disruption of the cell followed by protein 
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fractionations using mechanical, chemical, biological or 
immunological properties of the RB47 or RB60 protein. 
Preferred separation/isolation methods are described in 
the Examples . 

5 Thus, the invention also provides a method for the 

production of recombinant proteins, either as intact 
RB47 or RB6 0 protein, as fusion proteins or as smaller 
polypeptide fragments of RB47 or RB60. The production 
method generally involves inducing cells to express a 

10 recombinant protein of this invention, recovering the 
expressed protein from the resulting cells, and 
purifying the expressed protein so recovered by 
biochemical fractionation methods, using a specific 
antibody of this invention, or other chemical 

15 procedures . Inducing expression of a recombinant 

protein can comprise inserting a rDNA vector encoding an 
RB47 or RB60 protein, or fragment thereof, of this 
invention, which rDNA is capable of expressing the 
structural gene encoding the RB47 or RB60 protein, into 

20 a suitable host cell, and expressing the vector's 
structural gene. 

Thus, to facilitate expression of a recombinant 
protein or fusion protein of the present invention, DNA 
segments encoding either RB4 7 or RB60 as described 

25 herein, or portions thereof, are inserted into an 

expression vector. DNA segments are characterized as 
including a DNA sequence that encodes a recombinant 
protein of this invention, i.e., RB47 or RB60. That is, 
the DNA segments of the present invention are 

3 0 characterized by the presence of some or all of an RB47 
or RB60 structural gene as described herein. Preferably 
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the gene is present as an uninterrupted linear series of 
codons where each codon codes for an amino acid residue 
found in the native protein, i.e., a gene free of 
introns . 

5 

2 . Translati onal Regulation of Expression of a 
Coding Sequence 

In a related embodiment, the invention 
contemplates methods for the controlled expression of a 
0 preselected coding sequence under the regulation of RB47 
or a combination of RB47 and RB60 using the nucleotide 
sequences described herein that define an RB47 binding 
site and the recombinant proteins which bind this 
binding site and regulate translation of adjacent 
5 nucleotide sequences. 

Thus, for example, the RB47 binding site can be 
engineered into an expression cassette as described 
herein to control the expression of a structural gene 
nucleotide sequence which encodes a preselected gene . 
0 The expression cassette contains the following genetic 
elements: (1) a promoter sequence that initiates 
transcription of a gene, (2) an RB47 binding site 
sequence adjacent to and located 3' relative to the 
promoter sequence, (3) a structural protein coding 
5 sequence under the expression control of the promoter, 
and a source of RB4 7 protein to regulate the expression 
of the cassette. 

The promoter can be any of a variety of genetic 
elements as are well known for promotion of gene 
0 transcription. The promoter can be constitutive, 
inducible or repressible, thereby providing further 
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regulation. A preferred promoter is the lac z promoter 
inducible by IPTG, as is well known. Additional 
promoters include the T3 or T7 promoter. 

The RB47 protein can be provided exogenously, as by 
5 addition of isolated protein to an expression medium 
containing the cassette, provided endogenously , as by 
introducing the cassette into a host cell which contains 
endogenous RB47 {e.g., a chloroplast cell that expresses 
RB4 7) or provided by introducing a gene which expresses 
10 the RB47 protein into the expression medium, either in 
combination with the cassette, or substantially 
contemporaneously with the cassette. The RB47 encoding 
gene can be added as a separate plasmid, or can be 
present as a second translation unit on the cassette 
15 which expresses the preselected gene. 

Thus, in one embodiment, the invention comprises 
first forming an expression cassette by operably linking 
the above -identified components, and then introducing 
the expression cassette into a cell or other suitable 
2 0 expression medium. 

Where the expression system can be further 
regulated by RB60, the RB6 0 protein can be added to the 
expression system exogenously from purified recombinant 
protein, provided as an endogenous protein when 
25 expression is carried out in a plant cell, or can be 
provided by expression from a second translation unit. 
The second transcription unit can be present on a 
separate nucleotide sequence, such as a separate plasmid 
capable of expressing RB60 that contains the RB60 
3 0 nucleotide sequence, or present on the same expression 
cassette as a separate translation unit for RB60. 
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Expression cassettes can be introduced into an 
expression medium by any of a variety of means, and 
therefore the invention need not be so limited. For 
example, a variety of cell types can be used including 
5 bacterial, plant, yeast and higher eukaryote, all of 
which have different methods for transformation, 
including transduction, transf ection, electroporation, 
transformation, biolistic bombardment, infection, and 
the like. 

10 These systems provide particular advantages in the 

expression of preselected genes, including structural 
genes, insofar as these systems provide the ability to 
control timing and amounts of expression by specific and 
strong regulators of translation. The advantages will 
15 be apparent to one skilled in the art, but include 

synchronized expression in cell populations, combining 
expression with nutrient supplementation, regulated 
expression in therapeutic, manufacturing and diagnostic 
expression applications, and the like systems. 
20 In one embodiment, the method for expression of a 

desired (preselected) coding sequence comprises first 
the method of preparing an expression cassette having 
the various components described herein, followed by 
introducing the cassette into an expression medium and 
25" maintaining the cassette under condition suitable for 
expression. To that end, the cassette can be prepared 
by any recombinant DNA method, which methods are well 
established in the art, including use of restriction 
enzymes to ligate nucleotide fragments, polymerase chain 
30 reactions (PCR) to isolate, mutate, modify and 

manipulate nucleotide fragments, and cloning sites for 
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insertion of preselected genes. An exemplary method 
involves operably linking the RB47 binding site sequence 
to a cloning site for insertion of a desired coding 
sequence, such that the cloning site is downstream of 
5 the binding site, and linking a second nucleotide 

sequence which encodes an RB47 polypeptide. The method 
can further involve linking a promoter 5* upstream to 
the RB47 binding site to form a transcription unit 
containing from 5» to 3 ' a promoter, a binding site and 

10 a cloning site for inserting the desired coding 
sequence. In a subsequent step for forming an 
expression cassette, the desired coding sequence is 
inserted into the cloning site. 

Other permutations will be apparent to one skilled 

15 in the art. 

3 .qnreenina for Agonists and Antagonists of 
RR47-Mediated Tra nslation 
In another embodiment, the invention 
2 0 contemplates using an expression cassette containing an 
RB47 binding site to screen for agonists and antagonists 
which affect RB47 binding to the RB4 7 binding site, 
thereby identifying useful reagents for further control 
of an RB4 7 -regulated (mediated) translation unit. 
25 The method comprises providing an expression 

cassette according to the invention and having a 
indicator polypeptide as the desired structural gene 
into an expression system (i.e., medium), introducing 
RB47 and the candidate agent, and detecting the amount 
30 of indicator polypeptide expressed, and thereby the 

amount of effect the agent has on the expression system. 
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Controls are typically run in the presence and absence 
of the RB47 protein to demonstrate selectivity of the 
agent, which could be either an agonist or antagonist of 
RB47 activation of translation upon binding to the RB47 

5 binding site. 

Typical indicator polypeptides include enzymes 
which produce detectable substrates, light producing 
enzymes, such as lucif erase, and the like. The RB47 can 
be added in the form of exogenous protein or by 
10 expression off of a nucleotide sequence, as described 
earlier. 

In one embodiment, the expression system is a cell 
capable of supporting expression (transcription and 
translation) and the RB47 is provided in the cell either 
15 by adding protein to the cell or by providing a RB47- 
encoding nucleotide sequence to the cell. 

In a further embodiment, the screening method is 
useful to identify agonists or antagonists of RB6 0 or 
RB60-mediated regulation of RB47-mediated translation, 
20 ie., reagents which effect RB60 rather than RB47 

directly. This embodiment requires that the additional 
component RB60 be included in the screening method as 
described herein for expression using RB60. 

Additional permutations are readily apparent to one 
25 skilled in the art. 

E. Articles of Man ufacture 

The present invention also contemplates an article 
of manufacture comprising one or more of the components 
30 of the present invention. Typically, the article is 
present in the form of a package containing the 
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component or in combination with packaging material. 
The packaging material includes a label or instructions 
for use of the components of the package. The 
instructions indicate the contemplated use of the 
5 component as described herein for the methods or 
compositions of the invention. 

For example, an article of manufacture can comprise 
one or both of the recombinant proteins of the 
invention, RB47 and RB60, in amounts useful in a method 
0 according to the invention. Alternatively, an article 
of manufacture can contain an expression cassette for 
expressing a desired coding sequence, which cassette 
comprises a construction as described herein that 
includes an RB47 binding site, and can optionally 
5 include a cloning site for insertion of a desired coding 
sequence, a promoter for controlling transcription of 
the expression cassette and inserted coding sequence, a 
coding sequence for the RB47 protein and/or the RB60 
protein, and a preselected coding sequence. 
0 Alternatively, the article of manufacture may contain 
multiple nucleotide sequences, such as separate plasmid 
each encoding a different transcription, comprising one 
or more of the desired coding sequence under control of 
the RB4 7 binding site, the RB47 coding sequence and the 
5 RB60 coding sequence. 

The article of manufacture may optionally include 
both an expression cassette and one or both of the 
recombinant proteins RB4 7 and RB60, or may contain a 
cell transformed by one or more of the expression 
0 cassettes of the present invention. 

In a related embodiment, an expression cassette may 
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be used for expressing an RNA transcript containing an 
RB47 binding site, useful for subsequent regulation of 
translation of the transcript by RB47 protein. Such a 
construct can be used in the RNA expression field. 
5 Therefore, the invention contemplates an article of 
manufacture comprising packaging material, and in a 
separate container an expression cassette for expressing 
RNA that includes the RB47 binding site, wherein the 
packaging material includes a label that indicates the 
10 uses of the cassette in producing in vitro RNA 

transcripts. The production of RNA transcripts is well 
known. The article can further contain in separate 
containers components useful in combination with the 
cassette, including polymerases buffers, ribonucleotides 
15 and other reagents for in vitro transcription. 

In these permutations, the components may 
optionally be present in the article of manufacture in 
separate containers . 

20 Ryamples 

The following examples relating to this invention 
are illustrative and should not, of course, be construed 
as specifically limiting the invention. Moreover, such 
variations of the invention, now known or later 
25" developed, which would be within the purview of one 

skilled in the art are to be considered to fall within 
the scope of the present invention hereinafter claimed. 

1. nnnina o f RB47 
30 RB47 protein, 47 kilodalton (kDa) was purified by 

published procedures (Danon and Mayfield, F.mbo J, . 
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10:3993-4001 (1991)). The protein was then digested 
with proteinase Lys-C or trypsin, and the peptides 
separated HPLC and microsequenced (John Lesyk, 
Worchester Foundation for Experimental Biology, 
5 Worcester, MA and Arie Admon, The Protein Center, 

Department of Biology, Technion, Haifa, Israel) . Two 
peptide sequences were obtained ( QYGFVHFEDQAAADR (SEQ ID 
NO 1) and GFGFINFKDAESAA (SEQ ID NO 2) ) . Degenerate 
oligonucleotides were designed based on the reverse 
10 translation of these peptides. For the QYG. . . and 
GFG. . . peptides, the respective oligonucleotide 
sequences were 5 • CAGTACGGYTTCGTBCAYTTCGAGGAYCAGGC3 ' (SEQ 
ID NO 3) and 

5 ' GGAATTCGGYTTCGGYTTCATYAACTTCAAGGAYGCBGAG3 • (SEQ ID NO 
15 4) , where the underline indicates an Eco RI restriction 
site and where Y-C or T; and B=G or T or C. A C. 
reinhardtii cDNA X-gtlO phage library obtained from EMBL 
Laboratories, Heidelberg, Germany, was screened with 
these oligonucleotides using standard methods as 

2 0 described by Sambrook et al . , Molecular Cloning: A 

Laboratory Manual , Cold Spring Harbor Laboratory Press, 
Chapter 8, 1989. One set of duplicate filter lifts was 
probed with each oligonucleotide, and plaques that 
hybridized to both were isolated. Several cDNA clones 
25 that hybridized to oligonucleotides from both peptides 
were identified. Four of these clones were 2.6 
kilobases (kb) in length, the predicted full length of 
the RB47 mRNA. One of these cDNAs was subcloned into an 
E. coli plasmid for sequence determination using an 

3 0 automated sequencer. 

The nucleotide and encoded amino acid sequence of 
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RB47 is also shown in Figures lA-lD (SEQ ID NO 5) . As 
described in Section 2 above, the predicted protein 
sequence from the cloned cDNA contained both the derived 
peptide sequences of RB47 and is highly homologous to 
5 poly (A) binding proteins (PABP) from a variety of 
eukaryotic organisms, 

2 . Clonincr of RB60 

To clone the cDNA encoding the 6 0 kDa psJbA mRNA 

10 binding protein (RB60) , the psibA-specif ic RNA binding 
proteins were purified from light-grown C. reinhardtii 
cells using heparin-agarose chromatography followed by 
pshA RNA affinity chromatography (RAC) . RAC-purif ied 
proteins were separated by two-dimensional 

15 polyacrylamide gel electrophoresis. The region 
corresponding to RB60 was isolated from the PVDF 
membrane. RB60 protein was then digested with trypsin. 
Unambiguous amino acid sequences were obtained from two 
peptide tryptic fragments {WFVDGELASDYNGPR (SEQ ID NO 6) 

20 and (QLILWTTADDLKADAEIMTVFR {SEQ ID NO 7) ) as described 
above for RB47. The calculated molecular weights of the 
two tryptic peptides used for further analysis precisely 
matched with the molecular weights determine by mass 
spectrometry. The DNA sequence corresponding to one 

25 peptide of 22 amino acid residues was amplified by PGR 
using degenerate oligonucleotides, the forward primer 
5 » CGCGGATCCGAYGCBGAGATYATGACS ' (SEQ ID NO 8) and the 
reverse primer 5 ' CGCGAATTCGTCATRATCTCVGCRTC3 • (SEQ ID NO 
9) , where R can be A or G (the other lUPAC nucleotides 

3 0 have been previously defined above) . The amplified 

sequence was then used to screen a X-gtlO cDNA library 

RECTIFIED SHEET (RULE 91) 
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from C. reinhardtii. Three clones were identified with 
the largest being 2.2 kb. Selection and sequencing was 
performed as described for RB47 cDNA. 

The resulting RB60 cDNA sequence is available via 
5 GenBank (Accession Number AF027727) . The nucleotide and 
encoded amino acid sequence of RB60 is also shown in 
Figures 2A-2B (SEQ ID NO 10) . The protein coding 
sequence of 488 amino acid residues corresponds to 
nucleotide positions 16-1614 of the 2413 base pair 
10 sequence. The predicted amino acid sequence of the 

cloned cDNA contained the complete amino acid sequences 
of the two tryptic peptides. The amino acid sequence of 
the encoded protein revealed that it has high sequence 
homology to both plant and mammalian protein disulfide 
15 isomerase (PDI) , and contains the highly conserved 

thioredoxin-like domains with -CysGlyHisCys- (-CGHC-) 
(SEQ ID NO 11) catalytic sites in both the N- terminal 
and C- terminal regions and the -LysAspGluLeu- (-KDEL-) 
(SEQ ID NO 12) endoplasmic reticulum (ER) retention 
20 signal at the C-terminus found in all PDIs. PDI is a 
muti functional protein possessing enzymatic activities 
for the formation, reduction, and isomerization of 
disulfide bonds during protein folding, and is typically 
found in the ER. The first 30 amino acid residues of 
25' RB60 were found to lack sequence homology with the 
N-terminal signal sequence of PDI from plants or 
mammalian cells. However, this region has 
characteristics of chloroplast transit peptides of C. 
reinhardtii. which have similarities with both 
3 0 mitochondrial and higher plant chloroplast pre sequences . 
A transit peptide sequence should override the function 

BBCTIFIED SHEET (RULE 91) 
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of the -KDEL- ER retention signal and target the protein 
to the chloroplast since the -KDEL- signal acts only to 
retain the transported protein in the ER. 

3 , Preparation of psJbA Promoter Sequence and RB47 
Rinding Site Nucleotide Sequence 
The chloroplast psJbA gene from the green 
unicellular alga C. reinhardii was cloned and sequenced 
as described by Erickson et al . , EmbQ_i., 3:2753-2762 
(1984) , the disclosure of which is hereby incorporated 
by reference. The DNA sequence of the coding regions 
and the 5' and 3' untranslated (UTR) flanking sequences 
of the C. reinhardii psbA gene is shown in Figures 3A- 
3C. The psjbA gene sequence is also available through 
GenBank as further discussed in Example 4. The 
nucleotide sequence is also listed as SEQ ID NO 13. The 
deduced amino acid sequence (also listed in SEQ ID NO 
13) of the coding region is shown below each codon 
beginning with the first methionine in the open reading 
frame. Indicated in the 5' non-coding sequence are a 
putative Shine-Dalgarno sequence in the dotted box, two 
putative transcription initiation sites determined by SI 
mapping (SI) and the Pribnow-10 sequence in the closed 
box. Inverted repeats of eight or more base pairs are 
marked with arrows and labeled A-D. A direct repeat of 
31 base pairs with only two mismatches is marked with 
arrows labeled 31. Indicated in the 3' non-coding 
sequence is a large inverted repeat marked by a forward 
arrow and the SI cleavage site marking the 3 ' end of the 
mRNA. Both the 5' and 3' untranslated regions are used 
in preparing one of the expression cassettes of this 

RECHFIED SHEET (RULE 90 



wo 98/31823 



- 62 - 



PCT/US98/0a840 



invention as further described below. 

The 5' UTR as previously discussed contains both 
the pshA promoter and the RB4 7 binding site. The 
nucleotide sequence defining the psbA promoter contains 
5 the region of the psbA DNA involved in binding of RNA 
polymerase to initiate transcription. The -10 sequence 
component of the psbA promoter is indicated by the boxed 
nucleotide sequence upstream of the first SI while the - 
35 sequence is located approximately 35 bases before the 
10 putative initiation site. As shown in Figure 3, the -10 
sequence is boxed, above which is the nucleotide 
position (-100) from the first translated codon. The - 
3 5 sequence is determined accordingly. A psbA promoter 
for use in an expression cassette of this invention ends 
15 at the first indicated SI site (nucleotide position -92 
as counting from the first ATG) in Figure 3 and extends 
to the 5' end (nucleotide position -251 as shown in 
Figure 3) . Thus, the promoter region is 160 bases in 
length. A more preferred promoter region extends at 
20 least 100 nucleotides to the 5' end from the SI site. A 
most preferred region contains nucleotide sequence 
ending at the si site and extending 5' to include the - 
35 sequence, i.e., from -92 to -130 as counted from the 
first encoded amino acid residue (39 bases) . 
25 The psbA RB47 binding site region begins at the 

first SI site as shown in Figure 3 and extends to the 
first adenine base of the first encoded methionine 
residue. Thus, a psbA RB47 binding site in the psbA 
gene corresponds to the nucleotide positions from -91 to 
3 0 -1 as shown in Figure 3. 

The above- identified regions are used to prepare 
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expression constructs as described below. The promoter 
and RB47 binding site regions can be used separately; 
for example, the RB47 binding site sequence can be 
isolated and used in a eukaryotic or prokaryotic plasmid 
with a non-psbA promoter. Alternatively, the entire 
psbA 5' UTR having 251 nucleotides as shown in Figure 3 
is used for the regulatory region in an expression 
cassette containing both the psbA promoter and RB47 
binding site sequence as described below. 



10 



15 



P^>:. p^rP.t-ioP Kvnre s «-inn Vectors an(i Expressign Qf 
Coding f^pgiiences 

A. rr.nsi-Tnictp rnntain inn an DshA Prompter, an 
Rinding Nuc leotide S er pience, a 

no pivpH Hete-rnl ggou s r-nding Sequence, qn m^7- 
T;; p^r^Hing .qeouenf-e and an RB60- EnCQdinq 
Se quence 

Plasmid expression vector constructs, 
alternatively called plasmids, vectors, constructs and 
20 the like, are constructed containing various 

combinations of elements of the present invention as 
described in the following examples. Variations of the 
positioning and operably linking of the genetic elements 
described in the present invention and in the examples 
25" below are contemplated for use in practicing the methods 
of this invention. Methods for manipulating DNA 
elements into operable expression cassettes are well 
known in the art of molecular biology. Accordingly, 
variations of control elements, such as constitutive or 
30 inducible promoters, with respect to prokaryotic or 

eukaryotic expression systems as described in Section C. 
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are contemplated herein although not enumerated. 
Moreover, the expression the various elements is not 
limited to one transcript producing one mRNA; the 
invention contemplates protein expression from more than 
5 one transcript if desired • 

As such, while the examples below recite one or two 
types of expression cassettes, the genetic elements of 
RB47 binding site, any desired coding sequence, in 
combination with RB47 and RB60 coding sequences along 
10 with a promoter are readily combined in a number of 
operably linked permeations depending on the 
requirements of the cell system selected for the 
expression. For example, for expression in a 
chloroplast, endogenous RB47 protein is present 
15 therefore an expression cassette having an RB4 7 binding 
site and a desired coding sequence is minimally required 
along with an operative promoter sequence. 
Overexpression of RB4 7 may be preferable to enhance the 
translation of the coding sequence; in that case, the 
2 0 chloroplast is further transformed with an expression 

cassette containing an RB47-encoding sequence. Although 
the examples herein and below utilize primarily the 
sequence encoding the precursor form of RB47, any of the 
RB4 7 -encoding sequences described in the present 
25 invention, i.e., RB47 precursor, mature RB47 and 

histidine-modif ied RB47 are contemplated for use in any 
expression cassette and system as described herein. To 
regulate the activation of translation, an RB60-encoding 
element is provided to the expression system to provide 
3 0 the ability to regulate redox potential in the cell as 
taught in Section B. These examples herein and below 
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represent a few of the possible permutations of genetic 
elements for expression in the methods of this 
invention . 

In one embodiment, a plasmid is constructed 
5 containing an RB47 binding site directly upstream of an 
inserted coding region for a heterologous protein of 
interest, and the RB47 and RB60 coding regions. 
Heterologous refers to the nature of the coding region 
being dissimilar and not from the same gene as the 
10 regulatory molecules in the plasmid, such as RB47 and 
RB50. Thus, all the genetic elements of the present 
invention are produced in one transcript from the IPTG- 
inducible psbA promoter. Alternative promoters are 
similarly acceptable. 
15 The final construct described herein for use in a 

prokaryotic expression system makes a single mRNA from 
which all three proteins are translated. The starting 
plasmid is any E. coli based plasmid containing an 
origin of replication and selectable marker gene. For 
20 this example, the Bluescript plasmid, pBS, commercially 
available through Stratagene, Inc., La Jolla, CA, which 
contains a polylinker-cloning site and an ampicilin 
resistant marker is selected for the vector. 

The wild-type or native psbA gene (Erickson et al., 
25 Kmbo J. . 3:2753-2762 (1984), also shown in Figure 3. is 
cloned into pBS at the EcoRI and BamHI sites of the 
polylinker. The nucleotide sequence of the psbA gene is 
available on GenBank with the 5- UTR and 3- UTR 
respectively listed in Accession Numbers X01424 and 
30 X02350. The EcoRI site of psbA is 1-5 kb upstream of 
the psbA initiation codon and the BamHI site is 2 kb 
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downstream of the stop codon. This plasmid is referred 
to as pDl . 

Using site-directed PGR mutagenesis, well known to 
one of ordinary skill in the art, an Ndel site is placed 
5 at the initiation codon of psJbA in the pDl plasmid so 
that the ATG of the Ndel restriction site is the ATG 
initiation codon. This plasmid is referred to as 
pDl/Nde. An Nde site is then placed at the initiation 
codon of the gene encoding the heterologous protein of 
10 interest and an Xho I site is placed directly downstream 
(within 10 nucleotides) of the TAA stop codon of the 
heterologous protein coding sequence. Again using site- 
directed mutagenesis, an Xhol site is placed within 10 
nucleotides of the initiation codon of RB47, the 
15 preparation of which is described in Example 2, and an 
NotI site is placed directly downstream of the stop 
codon of RB47. The heterologous coding region and the 
RB4 7 gene are then ligated into pDl/Nde so that the 
heterologous protein gene is directly adjacent to the 
20 RB4 7 binding site and the RB47 coding region is 

downstream of the heterologous coding region, using the 
Xho I site at the heterologous stop codon and the Not I 
site of the pDl polylinker. 

These genetic manipulations result in a plasmid 
25 containing the 5' end of the psJbA gene including the 
promoter region and with the RB4 7 binding site 
immediately upstream of a heterologous coding region, 
and the RB4 7 coding region immediately downstream of the 
heterologous coding region. The nucleotides between the 
3 0 stop codon of the heterologous coding region and the 

initiation codon of the RB4 7 coding region is preferably 
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less than 20 nucleotides and preferably does not contain 
any additional stop codons in any reading frame. This 
plasmid is referred to as pDl/RB47. 

Using site-directed mutagenesis, a NotI site is 
5 placed immediately (within 10 nucleotides) upstream of 
the initiation codon of RB60, the preparation of which 
is described in Example 2, and an Xba I site is placed 
downstream of the RB60 stop codon. This DNA fragment is 
then ligated to the 3 ■ end of the psbA gene using the 
10 Xba I site found in the 3 • end of the psbA gene so that 
the psbA 3' end is downstream of the RB60 coding region. 
This fragment is then ligated into the pDl/RB47 plasmid 
using the NotI and BamHI sites so that the RB60 coding 
region directly follows the RB47 coding region. The 
15 resulting plasmid is designated pDl/RB47/RB60 . 

Preferably there is less then 20 nucleotides between the 
RB47 and RB60 coding regions and preferably there are no 
stop codons in any reading frame in that region. The 
final plasmid thus contains the following genetic 
20 elements operably linked in the 5 ■ to 3 • direction: the 
5' end of the psJbA gene with a promoter capable of 
directing transcription in chloroplasts, an RB47 binding 
site, a desired heterologous coding region, the RB47 
coding region, the RB60 coding region, and the 3- end of 
25 the psbA gene which contains a transcription termination 
and mRNA processing site, and an E. coli origin of 
replication and amplicillin resistance gene. A diagram 
of this plasmid with the restriction sites is shown in 
Figure 4 . 

30 Expression of pDl/RB47/RB60 in E. coli to produce 

recombinant RB47, RB60 and the recombinant heterologous 
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protein is performed as described in Example 4B. The 
heterologous protein is then purified as further 
described . 

Expression cassettes in which the sequences 
5 encoding RB47 and RB60 are similarly operably linked to 
a heterologous coding sequence having the psbA RB47 
binding site as described in Example 3 are prepared with 
a different promoter for use in eukaryotic, such as 
mammalian expression systems. In this aspect, the 
10 cassette is similarly prepared as described above with 
the exception that restriction cloning sites are 
dependent upon the available multiple cloning sites in 
the recipient vector. Thus, the RB47 binding site 
prepared in Example 3 is prepared for directed ligation 
15 into a selected expression vector downstream of the 
promoter in that vector. The RB47 and RB60 coding 
sequences are obtained from the pDl/RB47/RB60 plasmid by 
digestion with Xhol and Xbal and inserted into a 
similarly digested vector if the sites are present. 
20 Alternatively, site-directed mutagenesis is utilized to 
create appropriate linkers. A desired heterologous 
coding sequence is similarly ligated into the vector for 
expression. 

25" B. rr.r.at-^nnfs ront;.inina RR47 NucXeotide 3equenqQ 

1) Pn-rified Recombinant RB4 7 Protein 
In one approach to obtain purifxed 
recombinant RB47 protein, the full length RB47 cDNA 
prepared above was cloned into the E. coli expression 
30 vector pET3A {Studier et al . , MPthodp F;p?^Yinol .> 185:60- 
89 (1990)), also commercially available by Novagen, 
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Inc., Madison, WI and transformed into BL21 coll 
cells. The cells were grown to a density of 0.4 (ODgoo) , 
then induced with 0.5 mM IPTG. Cells were then allowed 
to grow for an additional 4 hours, at which point they 
5 were pelleted and frozen. 

Confirmation of the identity of the cloned cDNA as 
encoding the authentic RB47 protein was accomplished by 
examining protein expressed from the cDNA by immunoblot 
analysis and by RNA binding activity assay. The 

10 recombinant RB47 protein produced when the RB47 cDNA was 
expressed was recognized by antisera raised against the 
C. reinhardtii RB47 protein. The E. coli expressed 
protein migrated at 80 kDa on SDS-PAGE, but the protein 
was actually 69 kDa, as determined by mass spectrometry 

15 of the E. coli expressed protein. This mass agrees with 
the mass predicted from the cDNA sequence. A 60 kDa 
product was also produced in E. coli, and recognized by 
the antisera against the C, reinhardtii protein, which 
is most likely a degradation or early termination 

20 product of the RB47 cDNA. The recombinant RB47 protein 
expressed from the RB47 cDNA is recognized by the 
antisera raised against the C. reinhardtii protein at 
levels similar to the recognition of the authentic C. 
reinhardtii RB47 protein, demonstrating that the cloned 

2 5 cDNA produces a protein product that is immunologically 

related to the naturally produced RB47 protein. In 
order to generate a recombinant equivalent of the 
endogenous native RB47, the location of the 4 7 kDa 
polypeptide was mapped on the full-length recombinant 

3 0 protein by comparing mass spectrometric data of tryptic 

digests of the C. reinhardtii 47 kDa protein and the 
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full-length recombinant protein. Thus, peptide mapping 
by mass spectrometry has shown that the endogenous RB47 
protein corresponds primarily to the RNA binding domains 
contained within the N-terminal region of the predicted 
5 precursor protein, suggesting that a cleavage event is 
necessary to produce the mature 4 7 kDa protein. Thus, 
full-length recombinant RB47 is 6 9 kDa and contains a 
carboxy domain that is cleaved in vivo to generate the 
endogenous mature form of RB47 that is 47 kDa. 

10 To determine if the heterologously expressed RB47 

protein was capable of binding the psbA RNA, the £. coli 
expressed protein was purified by heparin agarose 
chromatography. The recombinant RB4 7 protein expressed 
in E. coli was purified using a protocol similar to that 

15 used previously for purification of RB47 from C. 

reinhardtii. Approximately 5 g of JET. coli cells grown 
as described above were resuspended in low salt 
extraction buffer (10 mM Tris [pH 7.5], 10 mM NaCl, 10 
mM MgClz, 5 mM E-mercaptoethanol ) and disrupted by 

2 0 sonication. The soluble cell extract was applied to a 5 
mL Econo-Pac heparin cartridge (Bio-Rad) which was 
washed prior to elution of the RB47 protein (Danon and 
Mayfield, Embo J . . 10:3993-4001 (1991)). 

The E. coli expressed protein that bound to the 

25 heparin agarose matrix was eluted from the colunm at the 
same salt concentration as used to elute the authentic 
C. reinhardtii RB47 protein. This protein fraction v/as 
used in in vitro binding assays with the psbA UTR. 
Both the 69 and 60 kDa E. coli expressed proteins 

30 crosslinked to the radiolabeled psJbA 5' UTR at levels 
similar to crosslinking of the endogenous RB47 protein. 
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when the RNA/protein complex is subjected to UV 
irradiation. 

Heparin agarose purified proteins, both from the E. 
coli expressed RB47 cDNA and from C. reinhardtii cells, 
were used in an RNA gel mobility shift assay to 
determine the relative affinity and specificity of these 
proteins for the 5» UTR of the psJbA mRNA. The E. coli 
expressed proteins bound to the psbA 5' UTR in vitro 
with properties that are similar to those of the 
endogenous RB47 protein purified from C. reinhardtii , 
RNA binding to both the E. coli expressed and the 
endogenous RB47 protein was competed using either 200 
fold excess of unlabeled psbA RNA or 200 fold excess of 
poly (A) RNA. RNA binding to either of these proteins 
was poorly competed using 200 fold excess of total RNA 
or 200 fold excess of the 5' UTR of the psJbD or psbC 
RNAs. Different forms of the RB47 protein (47 kDa 
endogenous protein vs. the 69 kDa E. coli expressed 
protein) may account for the slight differences in 
0 mobility observed when comparing the binding profiles of 
purified C. reinhardtii protein to heterologously 
expressed RB47. 

The mature form of RB47 is also produced in 
recombinant form by the insertion by PGR of an 
5 artificial stop codon in the RB47 cDNA at nucleotide 
positions 1403-1405 with a stop codon resulting in a 
mature RB47 recombinant protein having 4 02 amino acids 
as shown in Figure 1. An example of this is shown in 
Figure 5 for the production of a recombinant histidine- 
0 modified RB4 7 mature protein as described below. The 
complete RB47 cDNA is inserted into an expression 
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vector, such as pET3A as described above, for expression 
of the mature 4 7 kDa form of the RB47 protein. In the 
absence of the inserted stop codon, the transcript reads 
through to nucleotide position 2066-2068 at the TAA stop 
5 codon to produce the precursor RB47 having the above - 

described molecular weight characteristics and 623 amino 

acid residues. 

Recombinant RB47 is also expressed and purified in 
plant cells. For this aspect, C. reinhardtii strains 
10 were grown in complete media (Tris-acetate-phosphate 
[TAP] (Harris, The Chlawydonas Sourcebook, San Diego, 
CA, Academic Press (1989)) to a density of 5xl0« cells/mL 
under constant light. Cells were harvested by 
centrifugation at 4°C for 5 minutes at 4,000 g. Cells 
15 were either used immediately or frozen in liquid N, for 
storage at -70°C. 

Recombinant RB4 7 protein was also produced as a 
modified RB47 protein with a histidine tag at the amino- 
terminus according to well known expression methods 
20 using pET19-D vectors available from Novagen, Inc., 

Madison, WI . The nucleotide and amino acid sequence of 
a recombinant histidine-modif ied RB47 of the mature 47 
kDa form is shown in Figure 5 with the nucleotide and 
amino acid sequence also listed in SEQ ID NO 14. Thus 
25 the nucleotide sequence of a histidine-modif ied RB47 is 
1269 bases in length. The precursor form of the RB47 
protein is similarly obtained in the expression system, 
both of which are modified by the presence of a 
histidine tag that allows for purification by metal 
30 affinity chromatography. 

The recombinant histidine-modif ied RB47 purified 
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The recombinant histidine-modif ied RB47 purified 
through addition of a poly-histidine tag followed by Ni^^ 
column chromatography showed similar binding 
characteristics as that described for recombinant 
5 precursor RB4 7 described above. 

C. Constructs Containing RB6Q Nucleotide Sequence 
In one approach to obtain purified recombinant 
RB60 protein, the full-length RB60 cDNA prepared above 

10 was cloned into the E. coli expression vector pET3A 
(Studier et al . , Methods Enzymol . . 185:60-89 (1990)), 
also commercially available by Novagen, Inc., Madison, 
WI and transformed into BL21 E. coll cells. The cells 
were grown to a density of 0.4 (OD^oo) / then induced with 

15 0.5 mM IPTG- Cells were then allowed to grow for an 
additional 4 hours, at which point they were pelleted 
and frozen. 

Recombinant histidine-modif ied RB60 was also 
expressed with a pET19-D vector as described above for 

20 RB47 that was similarly modified. Purification of the 

recombinant RB60 proteins was performed as described for 
RB47 thereby producing recombinant RB6 0 proteins for use 
in the present invention. 

The RB60 coding sequence is also mutagenized for 

25' directional ligation into an selected vector for 

expression in alternative systems, such as mammalian 
expression systems. 

D. Constructs Conta ining an RB4 7-EncodinQ 
3 0 .Sequence and an RB60 -Encoding Sequence 

To prepare an expression cassette for encoding 
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both RB47 and RB60, one approach is to digest plasmid 
pDl/RB47/RB60 prepared above with Xhol and Xbal to 
isolate the fragment for both encoding sequences. The 
fragment is then inserted into a similarly digested 
5 expression vector if available or is further mutagenized 
to prepare appropriate restriction sites. 

Alternatively, the nucleotide sequences of RB47 and 
RB60, as described in Example 2, are separately prepared 
for directional ligation into a selected vector. 
10 An additional embodiment of the present invention 

is to prepare an expression cassette containing the RB47 
binding site along with the coding sequences for RB47 
and RB50, the plasmid pDl/RB47/RB60 prepared above is 
digested with Ndel and Xhol to prepare an expression 
15 cassette in which any desired coding sequence having 
similarly restriction sites is direct ionally ligated. 
Expression vectors containing both the RB47 and RB60 
encoding sequences in which the RB47 binding site 
sequence is utilized with a different promoter are also 
20 prepared as described in Example 4A. 

E. rnnstrucl-fi ronta j ni na an RB47 Binding Sitg 
wiif-1f>nt-,ide Se r pi>^nr-P. Insertion .Sites for a 
Df^Rired Hp>tp--roloaoii!=! Codin g Sequence, and an 
25 RR47 -Encoding S equence 

In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
region for a heterologous protein of interest, and the 
30 RB47 coding region. The final construct described 

herein for use in a prokaryotic expression system makes 
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a single mRNA from which both proteins are translated. 

The plasmid referred to as pDl/RB47 is prepared as 
described above in Example 4A. A diagram of this 
plasmid with the restriction sites is shown in Figure 6. 
5 Expression of pDl/RB47 in E. call to produce 

recombinant RB47 and the recombinant heterologous 
protein is performed as described in above. The 
heterologous protein is then purified as further 
described. 

10 To produce an expression cassette that allows for 

insertion of an alternative desired coding sequence, the 
plasmid pDl/RB47 is digested with Ndel and Xhol 
resulting in a vector having restriction endonuclease 
sites for insertion of a desired coding sequence 

15 operably linked to a RB47 binding site and RB47 coding 
sequence on one transcriptional unit. 

F, ronstructs CQntainina an 1^R4 7 Binding Site 
■NTnnleotide .qeguen c p . Insertion Sites for a 
20 neflired Heterologous C o d-ina Sequence, and an 

7^1^4 7 -Encoding Sequence 
In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
2 5 " region for a heterologous protein of interest, and the 
RB60 coding region. The final construct described 
herein for use in a prokaryotic expression system makes 
a single mRNA from which both proteins are translated. 
In this embodiment, a separate construct encoding 
30 recombinant RB47 as described in Example 4B is co- 
transformed into the E. coll host cell for expression. 
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The plasmid referred to as pDl/RB60 is prepared as 
described above for pDl/RB47 in Example 4A with the 
exception that Xhol and Xbal sites are created on RB60 
rather than RB47. 
5 Expression of pDl/RB60 in E. coli to produce 

recombinant RB60 and the recombinant heterologous 
protein is performed as described in above with the 
combined expression of RB4 7 from a separate expression 
cassette. The heterologous protein is then purified as 

10 further described. 

To produce an expression cassette that allows for 
insertion of an alternative desired coding sequence, the 
plasmid pDl/RB60 is digested with Ndel and Xhol 
resulting in a vector having restriction endonuclease 

15 sites for insertion of a desired coding sequence 

operably linked to a RB47 binding site and RB60 coding 
sequence on one transcriptional unit. 

G. ronstruct s Containing RB47 Bindincy Site 
20 Nucleotide Sequence and Heterologous Coding 

Sequences 

1) Expression of Rec ombinant Tetanus Toxin 
jingle Chain Antibody 

The examples herein describe constructs 
25' that are variations of those described above. The 

constructs described below contain an RB47 binding site 
sequence and a heterologous coding sequence. The 
activating protein RB47 was endogenously provided in the 
chloroplast and or plant cell. In other aspects however 
3 0 as taught by the methods of the present invention, the 
chloroplast is further transformed with an RB47- 
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expression construct as described above for 
overexpression of RB47 to enhance translation 

capacities . 

A strain of the green algae Chlamydomonas 
reinhardtii was designed to allow expression of a single 
chain antibody gene in the chloroplast. The 
transgenically expressed antibody was produced from a 
chimeric gene containing the promoter and 5' 
untranslated region (UTR) of the chloroplast psbA gene 
prepared as described above, followed by the coding 
region of a single chain antibody {encoding a tetanus 
toxin binding antibody) , and then the 3 ■ UTR of the psbA 
gene also prepared as described above to provide for 
transcription termination and RNA processing signals. 
This construct is essentially pDl/Nde including a 
heterologous coding sequence having a 3' Xbal 
restriction site for ligation with the 3 • psbA gene and 
is diagramed in Figure 7. 

The psbA- single chain construct was first 
transformed into C. reinhardtii chloroplast and 
transforraants were then screened for single chain gene 
integration. Transformation of chloroplast was 
performed via bolistic delivery as described in US 
Patents 5,545,818 and 5,553,878, the disclosures of 
which are hereby incorporated by reference. 
Transformation is accomplished by homologous 
recombination via the 5' and 3 ■ UTR of the psbA mRNA. 

As shown in Figure 8, two of the transf ormants that 
contained the single chain chimeric gene produced single 
0 chain antibodies at approximately 1% of total protein 
levels. The transgenic antibodies were of the correct 
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size and were completely soluble, as would be expected 
of a correctly folded protein. Few degradation 
products were detectable by this Western analysis, 
suggesting that the proteins were fairly stable within 
5 the chloroplast. To identify if the produced antibody 
retained the binding capacity for tetanus toxin, ELISA 
assays were performed using a mouse -produced Fab, from 
the original tetanus toxin antibody, as the control. 
The chloroplast single chain antibody bound tetanus 
10 toxin at levels similar to Fab, indicating that the 
single chain antibody produced in C. reiniiardtii is a 
fully functional antibody. These results clearly 
demonstrate the ability of the chloroplast to synthesis 
and accumulate function antibody molecules resulting 
15 from the translational activation of an RB47 binding 
site in an expression cassette by endogenous RB47 
protein in the chloroplast . 

2) Rx pression of Bac terial Luc if erase En^vme 
20 Having T wo Subunits 

For the production of molecules that 
contain more than one subunit, such as dIgA and 
bacterial lucif erase enzyme, several proteins must be 
produced in stoichiometric quantities within the 
25 chloroplast. Chloroplast have an advantage for this 

type of production over cytoplasmic protein synthesis in 
that translation of multiple proteins can originate from 
a single mRNA. For example, a dicistronic mRNA having 
5* and 3» Ndel and Xbal restriction sites and containing 
3 0 both the A and B chains of the bacterial lucif erase 

enzyme was inserted downstream of the pshA promoter and 
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5' UTR of the pDl/Nde construct prepared in Example 4A 
above. In this construct, the bacterial LuxAB coding 
region was ligated between the pshA 5- UTR and the psbA 
3' end in an E. coli plasmid that was then transformed 
5 into Chlamydowonas reinhardtii cells as described above 
for expression in the chloroplast. A schematic of the 
construct is shown in Figure 9. Single transf ormant 
colonies were then isolated. A plate containing a 
single isolate was grown for 10 days on complete media 
10 and a drop of the luciferase substrate n-Decyl Aldehyde 
was placed on the plate and the luciferase visualized by 
video-photography in a dark chamber. Both proteins were 
synthesized from this single mRNA and luciferase 
activity accumulated within the chloroplast as shown in 
15 Figure 10 . Some mRNA within plastids contained as many 
as 5 separate proteins encoded on a single mRNA. 

3) Ry pressj on of Dimeric XqA 

To generate dimeric IgA, the construct 
20 shown in Figure 11 is engineered so that the psbA 

promoter and 5- UTR are used to drive the synthesis of 
the light chain and heavy chains of an antibody, and the 
J chain normally associated with IgA molecules. The 
nucleic acid sequences for the dimeric IgA are inserted 
25 " into the RB47 binding site construct prepared in Example 
4A. The construct is then transformed into C. 

, . . - -. , .^-^r- ->"eT r'^^ocr-i bf^d for exoression 
reinhardtii ceiJ-S as ymrvs-O^s^:/ aw^cr^u-u 

of the recombinant dIgA. 

Production of these three proteins within the 
30 plastid allows for the self assembly of a dimeric IgA 
(dIgA) . Production of this complex is monitored in 
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several ways. First. Southern analysis of transgenic 
algae is used to identify strains containing the 
polycistronic chimeric dIgA gene. Strains positive for 
integration of the dIgA gene are screened by Northern 
5 analysis to ensure that the chimeric mRNA is 

accumulating. Western blot analysis using denaturing 
gels is used to monitor the accumulation of the 
individual light, heavy and J chain proteins, and native 
gels western blot analysis will be used to monitor the 
10 accumulation of the assembled dIgA molecule. 

By using a single polycistronic mRNA in the context 
of RB47 regulated translation, two of the potential 
pitfalls in the assembly of multimeric dIgA molecule are 
overcome. First, this construct ensures approximately 
15 stoichiometric synthesis of the subunits, as ribosomes 
reading through the first protein are likely to continue 
to read through the second and third proteins as well, 
second, all of the subunits are synthesized in close 
physical proximity to each other, which increases the 
20 probability of the proteins self assembling into a 
multimeric molecule. Following the production of a 
strain producing dIgA molecules, the production of dIgA 
on an intermediate scale by growing algae in 300 liter 
fermentors is then performed. Larger production scales 
25 are then performed thereafter. 

The foregoing specification, including the specific 

■ t-o illustrative 

embodiments and examples, ±s xno=n^e^ to ~_ i 

of the present invention and is not to be taken as 
limiting. Numerous other variations and modifications 
30 can be effected without departing from the true spirit 
and scope of the invention. 
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What Is Claimed Is: 

1. An expression cassette for expression of a 
desired molecule, which cassette comprises: 

a) an RB47 binding site nucleotide sequence 
5 upstream of a restriction endonuclease site for 

insertion of a desired coding sequence to be expressed; 
and 

b) a nucleotide sequence encoding a 
polypeptide which binds RB47 binding site. 

-LQ 2. The expression cassette of claim 1 further 

comprising a promoter sequence operably linked to and 
positioned upstream of the RB47 binding site nucleotide 
sequence . 

3 . The expression cassette of claim 2 wherein the 
15 promoter sequence is derived from a pshA gene. 

4 . The expression cassette of claim 3 wherein the 
coding sequence is heterologous to the pshA gene. 

5. The expression cassette of claim 1 wherein the 
cassette comprises a plasmid or virus. 

20 6. The expression cassette of claim 1 further 

comprising and operably linked thereto a nucleotide 
sequence encoding RB60. 

7. The expression cassette of claim 1 wherein the 
RB47 binding polypeptide is selected from the group 

25' consisting of RB47, RB47 precursor and a histidine- 

modified RB47. 

8. An expression cassette for expression of a 
desired molecule, which cassette comprises: 

a) an RB47 binding site nucleotide sequence 
3 0 upstream of a restriction endonuclease site for 

insertion of a desired coding sequence to be expressed; 
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and 

b) a nucleotide sequence encoding a 
polypeptide which regulates the binding of RB4 7 to the 

RB47 binding site, 
5 9. The expression cassette of claim 8 wherein the 

regulatory polypeptide is RB60. 

10. A recombinant RB47 protein. 

11. A recombinant RB60 protein. 

12. An isolated nucleotide sequence encoding RB47. 

13 . An isolated nucleotide sequence encoding a 
histidine-modif ied RB47. 

14. An isolated nucleotide sequence encoding RB47 

precursor . 

15 . The nucleotide sequence of claim 12 from 

15 nucleotide position 197 to 1402 in Figures lA-lB and SEQ 
ID NO 5. 

16 . The nucleotide sequence of claim 13 from 
nucleotide position 1 to 1269 in Figures 5A-5B and SEQ 
ID NO 14. 

20 17. The nucleotide sequence of claim 14 shown in 

from nucleotide position 197 to 2065 in Figures lA-lC 

and SEQ ID NO 5 . 

18. An expression cassette comprising the 
nucleotide sequence of claim 12, 13 or 14. 
25' 19. An isolated nucleotide sequence encoding RB60. 

20. The nucleotide sequence of claim 18 from 
nucleotide position 16 to 1614 in Fig^ares 2A-2B and SEQ 
ID NO 10. 

21. An expression cassette comprising the 
3 0 nucleotide sequence of claim 19. 

22. An expression system comprising a cell 
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transformed with the expression cassette of claim 1. 

23. The expression system of claim 22 wherein the 

cell is a plant cell. 

24. The expression system of claim 23 wherein the 
5 plant cell endogenously expresses RB47. 

25. The expression system of claim 23 wherein the 
plant cell endogenously expresses RB60. 

26. The expression system of claim 23 wherein the 
plant cell endogenously expresses RB47 and RB60. 

27. The expression system of claim 22 wherein the 
cell is a eukaryotic cell. 

28. The expression system of claim 22 wherein the 
cell is a prokaryotic cell. 

29. The expression system of claim 22 further 
15 comprising the expression cassette of claim 21. 

30. An expression system comprising a cell 
transformed with the expression cassette of claim 8. 

31. The expression system of claim 29 further 
comprising the expression cassette of claim 18. 

20 32. A cell stably transformed with the expression 

cassette of claim 18 . 

33. A cell stably transformed with the expression 

cassette of claim 21. 

34. A cell stably transformed with the expression 

25 cassette of claims 18 and 21. 

35. The expression cassette of claim 1 further 
comprising an inserted desired coding sequence. 

36. An expression system comprising a cell 
transformed with the expression cassette of claim 35, 

30 wherein the coding sequence is expressed forming the 
desired molecule upon activation of the RB47 binding 



wo 98/31823 



- 84 - 



PCT/US98/0a840 



site with RB47. 

37. The expression system of claim 36 wherein the 
cell is a plant cell endogenously expressing RB47. 

38. The expression system of claim 36 wherein the 
5 cell is stably transformed with the expression cassette 

of claim 21. 

39. An expression system comprising a cell 
transformed with an expression cassette comprising a 
promoter sequence, a RB4 7 binding site sequence, a 

10 desired coding sequence for a molecule, and a nucleotide 
sequence for encoding a polypeptide which binds RB47 
binding site, wherein all sequences are operably linked. 

40. A method of preparing a desired recombinant 
molecule wherein the method comprises cultivating the 

15 expression system of claim 36. 

41. A method of preparing a desired recombinant 
molecule wherein the method comprises cultivating the 
expression system of claim 39. 

42. A method for expressing a desired coding 

2 0 sequence comprising: 

a) forming an expression cassette by 

operably linking: 

1) a promoter sequence; 

2) a RB47 binding site sequence; 
25" 3) a desired coding sequence; and 

4) a nucleotide sequence encoding a 
polypeptide which binds RB47 binding site; and 

b) introducing the expression cassette into 

a cell. 

30 43. The method of claim 42 wherein the cell is a 

plant cell endogenously expressing RB47. 
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44. The method of claim 42 wherein the cell is a 
plant cell endogenously expressing RB60. 

45. The method of claim 42 further comprising 
inducing expression with a promoter inducer molecule. 

5 46. The method of claim 45 wherein the promoter 

inducer molecule is IPTG. 

47. The method of claim 42 wherein the cell is 
transformed with the expression cassette of claim 21. 

48. A method for expressing a desired coding 

0 sequence comprising: 

a) forming an expression cassette by 

operably linking: 

1) a promoter sequence; 

2) a RB47 binding site sequence; and 
5 3) a desired coding sequence; 

and 

b) introducing the expression cassette into 
a plant cell endogenously expressing RB47. 

49. The method of claim 48 wherein the expression 
2 0 cassette further comprises a nucleotide sequence 

encoding RB60. 

50. A method for the regulated production of a 
recombinant molecule from a desired coding sequence in a 
cell, wherein the cell contains the expression cassette 

25" of claim 34, wherein expression of the coding sequence 
is activated by RB47 binding to the RB47 binding site 
thereby producing the recombinant molecule. 

51. A method of forming an expression cassette by 

operably linking: 
3Q a) a RB47 binding site sequence; 

b) a cloning site for insertion of a desired 
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coding sequence downstream of the RB47 binding site 
sequence ; and 

c) a nucleotide sequence encoding a 
polypeptide which binds the RB47 binding site. 
5 52 . The method of claim 51 further comprising a 

promoter sequence operably linked upstream to the RB47 
binding site sequence. 

53 . The method of claim 51 further comprising a 
desired coding sequence inserted into the insertion 

10 site. 

54. A method of screening for agonists or 
antagonists of RB4 7 binding to RB47 binding site, the 
method comprising the steps: 

a) providing a cell expression system 
15 containing: 

1) a promoter sequence; 

2) a RB47 binding site sequence; 

3) a coding sequence for an indicator 

polypept ide ; and 
20 4) a polypeptide which binds to the 

RB47 binding site sequence; 

b) introducing an antagonist or agonist into 

the cell; and 

c) detecting the amount of indicator 
25' polypeptide expressed in the cell. 

55. A method of screening for agonists or 
antagonists of RB60 in regulating RB47 binding to RB47 
binding site, the method comprising the steps: 

a) providing an expression system in a cell 
3 0 containing: 

1) a promoter sequence; 
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2) a RB47 binding site sequence; 

3) a coding sequence for an indicator 

polypeptide; 

4) a polypeptide which binds to the 
5 RB47 binding site sequence; and 

5) a RB60 polypeptide; 

b) introducing an agonist or antagonist into 

the cell; and 

c) detecting the amount of indicator 
10 polypeptide expressed in the cell. 

56. An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the expression cassette of claim 1, wherein 
the expression cassette is useful for expression of a 
15 desired coding sequence, and wherein the packaging 
material comprises a label which indicates that the 
expression cassette can be used for expressing a desired 
coding sequence when the RB47 binding site is activated 
by RB47. 

20 57. The article of manufacture of claim 56 further 

comprising in a separate container the expression 
cassette of claim 18. 

58. The article of manufacture of claim 56 further 
comprising in a separate container the expression 

25 cassette of claim 21. 

59. An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the expression system of claim 22, wherein the 
expression system is useful for expression of a desired 

30 coding sequence, and wherein the packaging material 
comprises a label which indicates that the expression 
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system can be used for expressing a desired coding 
sequence when the RB47 binding site is activated by 
RB47. 

60. An article of manufacture comprising a 

5 packaging material and contained therein in a separate 
container the stably transformed cell of claim 32, 
wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 
indicates that the expression system can be used for 
10 expressing a desired coding sequence when the RB47 
binding site is activated by RB47. 

61. An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the stably transformed cell of claim 33, 

15 wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 
indicates that the expression system can be used for 
expressing a desired coding sequence when the RB47 
binding site is activated by RB47 and regulated by RB60. 

20 62. An article of manufacture comprising a 

packaging material and contained therein in a separate 
container the stably transformed cell of claim 34, 
wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 

25 indicates that the expression system can be used for 
expressing a desired coding sequence when the RB47 
binding site is activated by RB47 and regulated by RB60. 

63 . An article of manufacture comprising a 
packaging material and contained therein in a separate 

30 container the expression cassette of claim 2, wherein 
the expression cassette is useful for expression of a 
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RNA transcript, and wherein the packaging material 
comprises a label which indicates that the expression 
cassette can be used for producing in vitro a RNA 
transcript when the RB47 binding site is activated by 
5 RB47. 

64. The article of manufacture of claim 63 wherein 
the promoter sequence is selected from the group 
consisting of T3 and T7 promoters. 

65. The article of manufacture of claim 63 further 
10 comprising in separate containers a polymerase, a buffer 

and each of four ribonucleotides, reagents for in vitro 
RNA transcription. 
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